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Topic Outline 


The following topic outline necessarily contains technical termi- 
nology. Readers who have not yet reached a level of expertise 
sufficient to understand all of the terminology in this section may 
be aided by reading the Commentary on the Course Description, 
but a brief study of some computer science texts employing such 
terminology would be an even better aid. 

The topic outline has been reorganized and revised from the 
previous edition of the course description. Additions are shaded, 
but topics that have been moved to other locations in the outline 
are not shaded. See the enclosed special announcement. 


AP Computer Science Topic Outiine 


“A. Programming Methodology 


1. Specification : 
a. Problem definition and requirements 
b. Program and subprogram specifications (e.g., pre-/ 
postconditions, exceptional conditions) 
c. Abstract data types 
2. Design 
a. Decomposition 
i. Criteria (e.g., adaptability, minimizing depen- 
dencies) 
ii. Information hiding 
b. Top-down versus bottom-up methodologies 
c. Stepwise refinement of subprograms and data struc- 


tures 
d. User interface (e.g., error checking, help facilities) 
3. Coding 


a. Structure 
b. Style and clarity of expression (e.g., mnemonic iden- 
tifiers, indentation) 





4. Program correctness 


5. 


a. Testing and debugging 
i. Relation to design and coding 
ii. Generation of test data 
b. Verification 
i. Assertions and invariants - 
ii. Reasoning about programs 
Documentation 


B. Features of Programming Languages 


E 


ae 


a 


alli 


Data 
a. Types and declarations 


b. Scalar data types (e.g, Boolean, character, integer, 


real, subrange, enumerated) 
c. Structured data types (e.g., arrays, records, sets, | 
strings) . 

Block structure and scope of identifiers 
Expressions . 

a. Prefix and postfix notation 

b. Infix notation and operator precedence 
c. Standard functions 

Assignment statements 

Control structures 

a. Sequential execution 

b. Conditional execution 

c. Loops 

Input and output 

a. Terminal 

b. Other files 

Subprograms. 

a. Procedures and functions 

b. Parameters 

i. Actual and formal 
ii. Value and reference 
c. Recursion 
Program annotation (comments) 


Notation for language definition (e.g., syntax diagrams, 


Backus-Naur form) 
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Data Structures (alternative representations, operations, and 
applications) 


1. Linear 
a. Variations 
i. Lists 
ii. Stacks 


iii. Queues 
b. Representations 
i. Sequential 
ii. Linked (singly and doubly; circular; with and 
without list heads) 
2. Tree structures 
a. Terminology 
b. Binary tree 
i. Representations 
ii. Operations (e.g., insertions, deletions, travers- 
= als) 
3. Other linked structures (e.g., linked representation of 
sparse matrices) 


D. Algorithms 

1. Analysis of algorithms 

a. The meaning of “big-O” notation 
b. Worst-case time 

c. Worst-case space 

2. Searching 

a. Sequential (linear) search 

b. Binary search 

c. Hash-coded search 

d. Searching an ordered binary tree 
e. O(n) versus O(log n) searching times 


Sorting 
a. Simple sorts (e.g., insertion sort, selection sort, bub- 
ble sort) 


b.: Other sorts (e.g., merge sort, quicksort, radix sort) 
ce. O(n?) versus O(n log n) sorting times 
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4. Numerical algorithms 
a. Approximations (e.g., zeros of functions, Monte Carlo 
method) 
b. Numerical accuracy 
i. Round-off effects 
ii. Precision of approximations 


E. Computer Systems 


1. Major hardware components 
a. Primary and secondary memory 
b. Processors 
c. Peripherals 
2. System software 
a. Language translators 
b. Operating systems 
c. File systems 
.3. Types of systems 
a. Single-user systems 
b. Time-sharing and batch processing systems 
c. Networks 


F. Responsible Use of Computer Systems 


1. Privacy 

2. Reliability of systems ~ 

3. Legal issues and intellectual property 

4. Social ramifications of computer applications 


Note. Thorough coverage of topics E and F would require more 
time than is available in a single course. In an AP computer science 
course, a minimum treatment of these topics would consist of the 
following. For topic E, a working knowledge of the major hard- 
ware and software components of computer systems should be 
developed, although the operation of these components need not 


be studied in detail. For topic F, the importance of considering 


the ethical and social implications of computing systems should 
be stressed throughout the course, although no systematic cov- 
erage of these implications is required. 
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Some typical application areas studied in AP computer science 
paue the following. 


1. Text processing 
i a. String processing 

b. Editors 
c. Text formatters 
Simulation and modeling 
a. Continuous simulation of physical processes 
b. Discrete simulation of probabilistic events 
Data analysis 
a. Statistical packages 
b. Graphical display of data 
Data management 
a. Data base management systems and report generators 
b. Spreadsheets 
System software 
a. Mail systems 
b. Parsers 

i. Command scanners 

ii. Evaluation of arithmetic expressions 
c. Interpreters 
Games and puzzles (e.g., tic-tac-toe, Tower of Hanoi) 
Graphics 
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bommentary on the Course Description 


he following commentary was prepared by members of the AP 

omputer Science Development Committee to provide back- 

round information for teachers and school districts planning to 
Birer an Advanced Placement course in computer science. Since 
many teachers and other school personnel may not be completely 
eae with the nature and subject matter of college-level 

ourses in computer science, this commentary presents the phi- 
losophy behind the AP course and behind various topics in that 

ourse. A more thorough treatment of these topics may be found 
in the works listed under References. 
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Programming Methodology | 


Programming methodology involves 
a) disciplined technique of problem solving; 

b) data structures; 

c) algorithms; — 

d) analysis of algorithms; . 

e) programming language; svt ; ae 
_ {f} program testing, verification and debugging; and at 
- g) program documentation DOPE i is A ; 


-. In order to solve complex problems, it is necessary to know and implement 
characteristits and representations of different data structures, algorithms and pro- 
gramming languages. Implementation is simplified when it is considered as part of 
an overall, disciplined approach to problem solving, an approach which includes 
problem specification, top-down design, structured programming, testing, verifica- 


f tion, debugging and documentation. 


1. Specification 


The first step in a solution process is to understand what the problem is that 
you must solve. What input and output are required? What is the purpose of the 
program that you will write? Give a precise statement of the problem in terms of 
the input/output (I/O) that clearly identifies the tasks which a program must fulfill 
to solve the problem. For example, to state that an architect must design a house is 
not an adequate specification. You should also indicate the size of the family, 
income level, types and number of rooms and other details which will enable the 

„architect to complete a design successfully. 
You may not really understand a complex problem Well enough to give a 
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Review of Selected Topics 


specification. Try a variety of problem solving techniques until you find one which 
works. Some of the more common ones are: 

« exhaust all possibilities 

+ divide and conquer 

+ use a simulation 

* specify a number of cases 

* relate the problem to something familiar or known 

+ solve a simplified version of the original problem 


nas 2. Design 


Once a specification is determined, then a design is needed to achieve a pro- 
gram which meets the specification. Top-down design is a recommended plan of 


attack. Its objective is a main program consisting of procedure calls such that each. 


procedure solves one part of the original program. If any of the called procedures 
are too complex (i.e., accomplish more than one task) or are too long (over a page 
in length, approximately) then subprocedures should be used. With this kind of 


modularization, the resulting program should be clear enough for a reader to un- 


derstand how it works. 

Part of top-down design is a technique called “stepwise refinement,” by which 
the problem is decomposed into a logical set of subproblems or modules which, 
when solved, will solve the original problem. If any of the modules are simple 
enough, they can be converted into code; otherwise a module will be further 
decomposed or refined. Eventually, all modules are simple enough to encode and 
the entire program can be written. 

The top-down design process proceeds from general to particular. Details, 
such as the choice of algorithm or data structures, are deferred to the lowest level 
possible. Initial states are program language independent. Design changes, when 
necessary, are easier because their effects on other parts of the solution are 


evident. 


A complex design is illustrated by the problem of computing an employee’s 


pay. 
Design modules: 


Step 1. Get record 

Step 2. Compute gross pay 
Step 3. Compute tax 

Step 4. Compute deductions 
Step 5. Compute net 

Step 6. Print pay record 
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Example of level 1 stepwise refinement of step 3. 


Step 3. Compute tax 
3.1 Compute federal tax 
3.2 Compute state tax 


Example of level 2 stepwise refinement of step 3. 


3.2 Compute state tax 
3.2.1 Residence? 
3.2.2 Determine the %. 


A diagram can help to illustrate ‘the stepwise refinement of a top-down 
design. 
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A bottom-up design, on the other hand, would involve working with the 
lowest level, most detailed parts and trying to combine them into a problem 
solution. Details and decisions on algorithms and data structures are made 
early, before the entire solution plan is apparent. Solving complex problems in 
this way frequently leads to inefficient or incorrect programs, excessive debug- 

ging, and no clear-cut division of work when more than one programmer is 
involved. Coding is practically the first rather than the last step in the solution 


process. 
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3. Coding 


i 


When the top-down design process is followed, coding translates the lowest 
design level into a program for execution by a computer. The program should 
reflect the modular nature of the solution design. There should be a direct corre- 
spondence between modules in the design and procedures in the program. Struc- 
tured programming accomplishes this and a language like Pascal supports this 
approach. i 

When top-down design and structured programming are properly imple- 
mented, the end result is a program that is readable, reliable, valid, useful, and 
extensible. Documentation, testing, and modifications are made easier by this 
process. 

Clarity and style are two programming goals which are attainable through a 
structured programming language like Pascal. Procedures and functions allow 
modularity in a program. The declaration clearly identifies named constants, vari- 


ables, data types and structures used. Identifier names can be chosen to indicate . 


their role in a program. Free formatting allows indentation and combining state- 
ments to facilitate reading and understanding. Output can be formatted and la- 
beled so that program results are evident. Comments can and should be used 
liberally; they should identify named constants and variables, beginning and end- 
ing of structures and blocks, beginning of the main program and of each procedure 
or function, and everywhere else that would help a reader understand what actions 
a portion of code will perform or what values are permissible at the beginning and 


ending of structures. 


4, Program Correctness 


If a program does what we intend it to do (i.e., if it meets our specification), 
then we say it is correct. Proving correctness of a complex program is difficult. It 
can be done for simple programs and parts of complex ones. 


a) Testing: : 

Program testing is one way to show that a program is correct, but only for the 
values tested. The better the test plan and data, the more likely the program will 
do what we intend. Top-down design facilitates testing. Test data should be gener- 
ated as modules are created in the design process. Each module, when pro- 
grammed, should be ‘tested separately, then in combination with other modules 
according to the design. Thus both bottom-up and top-down testing are done. 
Expected output of modules.must be supplied to test lower level modules. Test 
data should be chosen to check every path through a module (and combination of 
modules including itself), limiting values such as maxima or minima, number of 
times through a loop (including 0, 1, and N}, parameter values in procedure calls, 
and any other significant aspect of the program. Because it is seldom reasonable or 
possible to check every value (e.g., every possible integer), test data should be 
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chosen judiciously to instill an acceptable level of confidence that the procedure or 
program is correct without spending too much time doing it. 


b) Verification: 

Program verification, as a formal study, involves techniques for proving pro- 
gram correctness. Many of its topics are beyond the scope of an AP Computer 
Science course. While testing provides evidence that a program may be correct 
based on a set of values, verification provides proof for all possible values. 

Two goals of modular programming, as used to implement top-down design, 
are short modules doing only one task and modules with only one entry and one 
exit point. When those goals are achieved, verification is simplified. One-task 
modules are easier to prove correct because they are less complicated. One-entry- 
one-exit modules make it simpler to express all conditions in effect at time of 


entry (preconditions). Program comments can actually be “assertions” about these - 


conditions. A structure (e.g., loop, if-then) can then be considered as a mapping of 
preconditions (domain) onto postconditions (range). If the set of statements in the 
structure or procedure operate on the preconditions and produce the postcondi- 
tions, then that structure or procedure has been proved to be correct. 

Let’s look at a simple example. Consider the one statement structure 


Y:=X°*xX. 


Assume the only precondition is X € Z (read X is a member of Z), where Z is 


the set of all integers, and the only postcondition is Y € N, where N is the set of 


all nonnegative integers. Then the statement (structure) is correct. Any x value 


produces a non-negative integer. If the postcondition had been Y € (N-(0)},.{i-e., y is 
a member of the non-negative integers except 0}, then the statement would not be 
correct because Y is 0 when X is 0. Y = 0 is not an acceptable value. This example 


~~ may seem overly simple but it is not necessarily trivial.’ At some later point in the 


program, Y may be a divisor. 

A comment (* X is any interger *) preceding the statement Y := X * X anda 
comment (* Y is a nonnegative integer; X is any integer *) or (* Y is a positive 
integer; X is any integer *) would be called assertions. Little effort or knowledge was 
required to derive the postcondition from the precondition. In more complex struc- 
tures, rules of algebra, logic and/or set theory may be required to derive the postcon- 
ditions. The process can become very mathematical and extremely complicated. 


c) Debugging: 

Because program verification is not always feasible and program testing does 
not find all errors, program debugging is required. Problem specification, top-down 
program design, ‘structured programming and testing decrease the possibility of 
errors (bugs) being introduced into programs. When these techniques are applied 
with care, the amount of debugging necessary should be extremely small and 
relatively straightforward. Syntax errors (errors which violate the rules of valid 


iff this discussion is too mathematical for you, the point is still quite simple. One-entry- 
one-exit modules make it easy to express the conditions of entry (preconditions) and their 
relationships to the exit conditions (postconditions). If some statements in the module 
change the preconditions to the postconditions, the procedure is valid. If the statements 
cause the preconditions to violate the “exit rule” or postconditions, then program correct- 


ness is in jeopardy. 
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construction) at compile time should be easily remedied and semantic errors fer- 
rors which violate rules which govern meaning) at run time should be easily 
identified. Logical errors (errors which may not terminate a program but which 
cause unintended results) are much less likely to be found when the program is 
properly specified and the solution design is top-down rather than bottom-up. 
Unintended outcomes should not happen in top-down designs, but, if they do, 
their correction should involve relatively few modules. Debugging could be exten- 
sive with the bottom-up approach. Numerous debugging techniques exist and are 


illustrated throughout this book. 


5. Documentation 


The purposes of program documentation are to inform other programmers 
working on the same program or to remind the programmer about matters which 
would facilitate understanding of the program’s purpose and use. Documentation 
may be internal or external to a program. For example, an instruction manual on 
how to use an editor, a word processor, or a set of matrix manipulation routines 

would generally be external documentation. Internal documentation may consist 
of ranges of variables, input values for a numerical. integration procedure or a 
statistical program, variable identification, and other comments. 
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8. Programming Style 


we often appreciate the author's writing for more than just catchy phrasing 
or correct punctuation. A good story works as a whole, with all the pieces fitting together nicely: the 
characters fit the action, the action fits the setting, and the plot is interesting and believable. A good story 
deesn't happen magically; good writers may spend months on short stories and years on novels. They 
start with ideas, develop an outline, write drafts, read them, and revise them until they get them right. 
Writing is a process, a series of separate but related activities, that achieves a goal. “With this view of 
writing as proccess Comes the concept of siyle, or control. For example, when a critic speaks of 
Hemingway's style or Faulkner's style, they are referring to the way the author. controlled the writing 


process to achieve a particular effect in his fiction. 


When we read a good story, 


Similar to the way a good writer controls the process of writing prose or poetry, a good programmer 
controls the process of writing a computer program. Unlike textbooks on English, many programming 
textbooks say little or nothing about style. Many of those that do typically discuss the appcarance or 
format of a program. [n contrast to that view, we consider programming style to be analogous to writing 
style: It isa way or method of controlling. the entire programming proccss, from the initial idea to the 
working program. ‘The programming process includes four separate but related activities: 


1. specification -- what the program should do; 
2. design -- how the program is to do what it should do; 


3. implementation -- coding the design into a programming language; 


4. verification and testing ~ ensuring the program does what it should. 
This chapter will examine cach of these four activities in the order in 
out. For each activity, we will suggest ways that you can dev 
programs casicr to write, casicr to usc, an 
you learn these points and apply them to your programming, 
poor programming practices can create. ; 


8.1 A Style of Program Specification 


The first’step in creating a computer program is to determine wh 
may sccm obvious to you, you should try to remember that what is 0 
occasions, a programmer has written most of the code for a program 


something that was not considere 
house. Ifa new house is nearly completed, the constructor has a real problem if he then fin 


owner wanted two more bedrooms on the second floor. 


A computer programmer should always $ 
solve. If the problem is an assignment for 
virtually all other cascs, the problem statement must be generated by the pro 
who wants the program written. Based on the problem statement, the programmer tl 
called a functional specification. The specification s 
functionality), what input it will take, and what output it will 


specification contains no information on how the program works, only information on what it docs. 
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Just as building a skyscraper is much more difficult than building a four room house, building a large 
computer program is much harder than building a small program. ‘The skyscrapers of programming are 
called sofiware systems, and their builders arc called sofiware enginecrs. Just as an architect creates a - 
blueprint before any phase of construction starts, software engineers (architects) prepare a functional 
specification before starting any phase of program construction. An architect’s blueprint is considered 
good if a builder can build the desired building given only the information contained in the blueprint. 
Similarly, a program's functional specification is considered good if a programmer can build a satisfactory 
program given only the information contained in the functional specification. A functional specification 
which neglects required program features is very poor in the same way as a blueprint which doesn’t 
mention a few rooms of the new building. . . 


We could probably build a dog house without a blueprint, and we could probably write a small 
program without a functional specification. As the size and complexity of a problem increases, the need 
for a functional specification increases. Every programming task can be made casier with a good 
functional specification. You should develop carly on the habit of creating a specification for all your 
computer programs before writing them, 


8.1.1 An-Example Functional Specification 


Consider the program on your computer system that accepts commands from the user to handle such 
tasks as listing files, deleting unwanted files, cte. Such a program is often referred to as the monitor. 
Assume we wish to create a simple monitor program. Our first stcp is to write the functional 
specification. 


) 


"The purpose of the program is to do the file management tasks associated with a monitor. The program 
must maintain a directory or list of files. -Each file has three attributes, name, page size, and deleted 
status. Input will be user commands and is confined to single letters. The following are the required 


commands; 
eL List all files in directory (excluding deleted files) 
eD Delete an existing file 
eU Undelete a deleted file 
eC Create a new file 
eP Print a list of all deleted files in directory 
eÀ Print a list of ALL files (both deleted and not deleted) 
of Exits monitor - storing only files which are not marked as deleted 


‘The output is pretty much as you might expect. ‘There must be a confirmation message for cach 
command, For erroneous commands such as invalid letters, trying to delete a file which doesn’t exist, or 
trying to undelete a file which isn’t deleted, an error message must be given. Each of the three listing 

-commands must have an appropriate header; it must show each file name and its page size on a scparate 
line; and it must give a summary showing the total number of files listed and the total number of pages 
for those files. 


+ 
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Note that we have given the program specification in an informal manner. It is more important to be 
complete than it is to be formal in specifying what a program is to do. Very large programs require a 
formal functional specification more because of their size and subsequent complexity than anything clse. 
Given the informal specification above, you should be able to write a Pascal program satisfying those 


requirements. 
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8.2 A Slyle of Program Design 


on is completed, the programmer knows what the program is to do and 
can turn attention to low the program is to do it. ‘The wrong way to begin is to simply sit down at a 
terminal and start typing in code. If you start this way, you can become bogged down in details and waste 
a lot of time generating code that you won't be able to use later. You should never allow the details to 
dictate the design, and that is what can happen if you approach the programming task the wrong way. In 


chapter 5 we discussed the right way, /op-down design. 


`~ 


Once the functional specificati 


8.2.1 Top-down Design 

Top-down design is a systematic strategy for solving programming problems. ‘The programmer starts 
with the overall problem to be solved and decomposes it into smaller problems (subproblems). Each of 
the subproblems is then decomposed into even smaller subproblems, until all the subproblems are small 
enough to be solved directly by tasks coded into a programming language. The following points 
summarize the benefits of the top-down strategy. For a more complete discussion of this topic see section 
5.1 beginning on page 223 where we give a detailed example for the task of grocery shopping. 


e It moves from the general to the specific, thereby postponing details until last which permits 
the programmer to think at the highest level at all times; 


e It allows the programmer to modify the design of subprograms without affecting other 
subprograms since cach subprogram is independent of others; 


} 


ə Ít is a general problem solving strategy that may be used for problems other than 
programming problems; ; ji 


ə It yiclds a hicrarchical solution that can be graphically depicted with what we call solution 
trees. 


8.2.2 Example of Program Design 


Let's develop an abstract design for the monitor program specified in the previous scction. The 
function of the program is to manage a directory of files. Lets decompose this task into the three 
subtasks of reading directory information for startup, processing users commands on the directory, and 
writing directory informatien back to storage upon program termination. This can be graphically depicted 


with the following solution tree: 


f 


Both read directory and write directory arc simple enough to be directly coded without any further 


’ 
F F 
' 
‘ 
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decomposition. ‘The most complicated module is process directory which should be decomposed further. 
Process directory repeatedly gets i user command and then executes the appropriate command. 
‘Therefore, process directory may be decomposed into get command, and one for each of the commands 
list, delete, undelete, create, list deleted, and fist all. “Wis 
with the following solution trec: 


additional level of decomposition is illustrated 






Ww rile Aitec tory 


Rend Directory 


Each of the modules under process directory are now simple enough to be directly coded in a 


programming language. We need not decompose the monitor program any further. 


Read Directory Process Duecioryy Wide Directory 


1 XS BBS 
i (one) } ae | 
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8.3 A Style of Program Implementation 


Having completed the design for the program, the programmer enters the third phase of programming: 


implementation. In this phase, the programmer takes the abstract design and writes the coded program. -- 


Note that the selection of a programming language may be postponed until this point. Having selected 


the programming language, the programmer may start generating code, the phase which most people - 


think of when someone says they are programming. ‘Though there is no conscnsus as to how the 
implementation phase should be carried out. ‘he method we prefer, advocate, and hope you will adopt is 
a top-down method of refining programs onc step at a time called stepwise refinement. We highly 
recommend this method of implementation. : 


1. Start by writing the main program as a sequence of calls to subprograms corresponding to the 
highest level subproblems in the solution tree. : 


2. Add the declarations for cach of the subprograms called in the main program. You need not 
add the entire body for cach subprogram declared at this point. You can add only the 
subprogram shells (stuks), each containing a WRITELN saying which subprogram is called 
and with what input values. x 


3. Select constants as output for VAR parameters and function values. 
4. Run this partial program to verify correctness of the top level of the design before proceeding. 


5. After confirming the design, fill out the body for cach of the existing stubs, adding calls to 
other subprograms. ; i eae 


6. Add the stubs for the additional subprograms. 


7. Run the modified partial program. 


8. Continue this method of development for cach level of the solution tree until all stubs are 
filled out. Note that you need not fill out al existing stubs in each pass; you can fill out as 
many of them as you arc comfortable with. 


For the monitor program we have been developing in this chapter, the main program would look 
something like the following: 


BEGIN 
ReadDirectory(directory); 
ProcessDirectory(directory); 
WriteDirectory(directory) 
END. 


Note how the subprogram calls in the main program correspond to the highest level subproblems in the 
solution tree. 


8.3.1 Declarations > 


In chapter 5 we discussed a style of declarations for constants, variables and subprograms, and in 
chapter 6 we presented a style for type definitions. We have found these points to be very useful over the 


years: F 
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e All named constants are defined globally to the program and are therefore accessible 
throughout the program, ‘The same cunstant should never be defined more than once ina 
program. ‘The only exception is when some named constant is used by only onc subprogram 
and we wish the constant to be inaccessible by ali other subprograms. “This strategy of keeping 
information inaccessibte from the rest of the program is called information hiding. Although 
information hiding is uncommon in small programs, we have illustrated it with locally defined 


constants before in this text. 


e All user-defined types are defined globally to the program and are accessible throughout the 
program. ‘The reasoning behind this is the same as for named constants. Again the sole 
exception is for information hiding purposes. ‘The authors have never seen a Pascal program 
in which there is some named type only used internally by a single procedure. 


o All subprograms are defined globally to the program. We discourage nested subprogram 
declarations. Ifa subprogram is declared within another subprogram, the nested subprogram 
may be called only within the scope of the subprogram in which it is declared. Remember 
that onc of the motivations for writing subprograms is so that they can be reused instead of 
duplicating the same code. ‘The sole exception for having a nested subprogram declaration is 
again information hiding. ‘This exception is also uncommon in small programs. Additionally, 
nested subprograms permit upper-level addressing which we also discourage. We never use 
nested subprograms in this text other than to illustrate how they work. We hope you will not 


use them cither. 


e All variables are declared local to the subprogram in which they are used. very variable in a 
procedure should either be a local variable to that procedure or a formal parameter to that 
procedure. There are no exceptions whatsoever. 


‘Having already written the main program for monitor, we know we will need a structure to represent a 
directory of files. We need a user-defined type to hold the list of files and also the number of files in the 
list. Let’s use a Pascal RECORD for DirectoryType as indicated below: 


DirectoryType = RECORD f 
files: FileListType; 
size: INTEGER 
END; (* DirectoryType *) 


We also need a user-defined type for the list of files. FileListType will simply be an array of files: 
FileListType = ARRAY [1..MaxFiles] OF FileType; 


Recall that for the purposes of the monitor program, a file has three attributes, which translates into a 
Pascal RECORD with three ficlds as indicated below: 


FileType = RECORD 
name: StringType; 
pagesize: INTEGER; 
deleted: BOOLEAN 
END; (* FileType *) 


Note that we developed the type definitions in a top-down fashion. We defined the largest first, then 
added the necessary smaller subtypes. We advocate top-down design of user-defined types. 
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8.3.2 Inter-Module Communication 


In chapter 5 we pointed out the need to communicate information from one subprogram to another 
subprogram, which we call intermodule communication. In Pascal there are exactly two ways io achieve 


‘such communication: through parameters and upper-level addressing. A variable may be referenced 


anywhere within the scope in which it is declared. ‘Therefore, all variables declared in the global scope of 
a Pascal program may be referenced throughout the entire program. Upper-level addressing occurs when 
a variable reference refers to a variable declared in some enclosing scope, that is, the reference and the 
declaration are in different scopes. You can illustrate upper-level addressing by drawing boxes around a 
program and its subprograms as we did in chapter 5. If you draw a line from the reference to its 
declaration, the line will cross the boundary of the box if it is upper-level addressing. 


Upper-level addressing of variables should never be permitted. All communication between subprograms 
should be made explicit through the use of parameters. Nested subprograms allow upper-level addressing, 
so we discourage such nestings. Even if all subprograms are declared globally (no nestings), variables 
declared globally to a program may still be referenced by upper-level addressing. It is for this reason that 
we also discourage global variable declarations. Remember this rule for variable references and inter- 
module communication: Every variable used in a subprogram must cither be declared in that subprogram 
or passed as a formal parameter to that subprogram. If you follow this rule, your programs will have no 
upper-level addressing; all inter-module communication will be through parameters. You can enforce 
this rule by declaring all subprograms globally (no nested subprograms) and climinating all global 


variables. All global variables and the main program can then be moved to a special subprogram which ` 


we have been calling GlobalVilter. 


. 
° 


You may be wondering about file variables. While it is true that Pascal requires filc variables to be 
declared globally, it is NOT true that Pascal requires upper-level addressing of file variables. File 
variables should be passed as parameters like all other variables. The above rule for inter-module 
communication should be followed for all variables including file variables. All file variables must be 
passed as VAR parameters; they may not be passed as value parameters. All other variables should be 
value parameters; except when you want to change the value of the actual parameter. 


The use of a GlobalFilter may be likened to the use of training wheels on a bicycle: at some point they 
are no longer needed. As you practice the rule stated above, it will soon become second nature. Like the 
ability to ride a bicycle, once learned the rule is never forgotten. 


We introduced functions in a mathematical sense as taking zero or morc inputs and returning one 
result, For this reason, we believe functions should only have value parameters. If we need a 
subprogram to return 3 outputs, use a procedure with 3 VAR parameters as opposed to a function with 2 
VAR parameters. 

Returning now to the monitor program, the first pass at the program looks something like the 
following: : 


File Date: January Ist 1985 at 20:13 Scribe Date: January 2nd 1985 at 00:36 
File Name: CHAPTERSB.MSS.64 line 143 Scribe Version: 4(1400) 


ea mH Ee HE eS F&F & Gs ë 





GTE 


` permanent storage; 


423 


Programming Style 


PROGRAM Monitor (INPUT, OUTPUT, data); fi 
5 / 


VAR 
data: TEXT; 


PROCEDURE ReadDirectory (VAR directory: DirectoryType; VAR from: TEXT); 


BEGIN 
END; "(* ReadDirectory *) 


PROCEDURE ProcessDirectory (VAR directory: DirectoryType) 


BEGIN 
END; (* ProcessDirectory =) 


PROCEDURE WriteDirectory (directory: DirectoryType; VAR to: TEXT); 


BEGIN 
END; (* WriteDirectory >j 


PROCEDURE GlobalFilter (VAR storage: TEXT); 


VAR 
directory: DirectoryType; 


BEGIN 
ReadDirectory(directory, storage); 3 


ProcessDirectory(directory); 
- WriteDirectory(directory, storage) 
END; (* GlobalFilter =) 


BEGIN (* Main Program =) 
GlobalFilter(data) 


END. (* Main Program 


Note thatthe main program is one call to GlobalFilter with the required file variable. The body of 
GlobalFilter is the calls to the three subprograms which correspond to the highest level of program 
decomposition in our design. The procedure ReadDirectory builds the directory structure from 
permanent storage and has two VAR parameters, the directory structure and the file to read from. The 
subprogram ProcessDirectory processcs all the user's commands on the directory structure and has a 
single VAR. parameter. The procedure |WriteDirectory writes the directory structure back out to 
it has two parameters, but only the file variable is a VAR. - At this stage in the 


implementation of the program monitor, we might want to add the user-defincd types and execute the 


partially completed program with the stubs. 
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8.3.3 Comments and Documentation 


A computer program is more than a sequence of instructions to be slavishly read and executed by a 
computer. Programs are read by people as well. Ifa program is exccuted more than a few times after its 
completion, it is almost guaranteed that somcone will want to read the code to fix it, extend it, or just to 
examine it for interest or evaluation. ‘To make your programs casy to read, you should use comments 
liberally. ‘lhe comments in a program are ignored by the compiler, they are added only to aid readers of 


the program. 


Every comment in a program describes some facct of the program. ‘There are three different type of 
comments in a program based on what they describe: (1) statement comments - describe some program 
statement; (2) subprogram comments - describe a subprogram; (3) program comments - describe the 


overall program. 


For program comments and subprogram comments, we have advocated structured comments with 
different fields to organize the different information contained in the comment. Our structured 
subprogram comments contain what the subprogram docs (DESCRIPTION), how the program does it 


(METHOD), what it requires (INPUT), and what it gencrates (OUTPUT). Note that all parameters are 
listed in cither INPUT or OUTPUT or both. Value parameters are in INPUT but not OUTPUT, and 
VAR parameters are in OUTPUT or both INPUT and OUTPUT. 


Program comments at the very Icast should say who wrote the program and when, what the program. 
does, how it does what it docs, what input it needs, and what outpul it gencrates. 


Statement comments describe some particular statement in the program, typically a statement whose 
action is not particularly obvious. A statement comment may also describe a variable declaration, a 
constant definition, or a type definition. We have also used a brief statement comment after cach END of 
a BEGIN-END pair to help determine if it is the END of a THEN clause, an ELSE clause, some 


subprogram, CASE statement, etc. 


Here are some principles you can follow for all comments: 


e Don’t just echo the code. The following is an example of a uscless comment: 


total := total + 1 (* this statement adds 1 to total *) 


e Be sure code and comment agree. Never say code does somcthing in a comment when it does 
something else. 


e Don’t use comments to justify bad code. Bad code should always be rewritten into good code. 
Good code usually needs fewer comments. 


e All comments describe something, whether a subprogram, a statement, or a variable 
declaration, etc. It should always be clear what cach comment describes. 


To develop a habit of using these principles, we suggest you start using comments early. Don’t wait 
until the entire program is coded to then go back and “make up” comments. ‘The program is freshest in 
your mind during coding, and comments can help you during implementation and debugging phases. 
Statement comments should be added when the statement is written, and subprogram comments should 


be written when the subprogram is written. 
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Comments should not get in the way of reading the code. ‘The.rcader of a prograin should never be 
forced to read a program's comments. Program comments are to clarify and provide elaboration for the 
code. ‘The help is to be available when the reader wants it, but someone already familiar with the 
program may not be interested in reading most of the comments and should not have to. The following 


code illustrates the use of structured comments at all levels of the program. 
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PROGRAM Monitor (INPUT, OUTPUT, data); - 


sevesecesersensccsoosscccgseenececccesesosseaseosssoseesesesesoeseleees: 


(* 
* WRITTEN BY: 
* WRITTEN: 

* LANGUAGE: 

: DESCRIPTION: 
* METHOD: 

* INPUT: 


* OUTPUT: 


ensccccncsecncsccesecccssesensassessscesconcesssscesessesseseseseeeeeee e) 


CONST ; 
MaxFiles = 100; Ca maximum number of files in directory structure *) 


TYPE 
FileType = RECORD 
name: StringType; 
pagesize: INTEGER; 
deleted: BOOLEAN 
END; (* FileType, *) 


FileListType = ARRAY [1..MaxFiles] OF FileType: 
DirectoryType = RECORD 
files: FileListType; 


size: INTEGER 
END; (* DirectoryType *) 


VAR 
data: TEXT; 


PROCEDURE ReadDirectory (VAR directory: DirectoryType; VAR from: TEXT); 


(* cecesccncccacccecsecsscssessssseeceseceseseseessesesesessssssesseseesee 


* DESCRIPTION: 
* METHOD: 
* INPUT: 


* OUTPUT; 
eecccccccccccecesesscccsasesssesssesessssessssesssesssssssseesssseeeeee *) . 


BEGIN 
END; (* ReadDirectory *) 


PROCEDURE ProcessDirectory (VAR directory: DirectoryType) 
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fi ‘ {> evoccsscaccccsseccnccsssccccesessesseasevassesecosscsssoosouseses REE 
® DESCRIPTION: 
i © MCTHOD: 
© INPUT: 
tt e 
[ $ QUIPUT: 
EE E E O TR E e a ae a i e sfetelrt* = 
fl BEGIN ; 
END; (* ProcessDirectory a3 
f PROCEDURE WriteDirectory (directory: DirectoryType; VAR from: TEXT); 
| p. E E T OE T a e e a a E E 
* DESCRIPTION: 
| + METHOD: 
% INPUT: 
* OUTPUT: 
i ccbceusanswacuasensqoucescccesccassessnsquabeeengprerene rere ee =< | + <y 
BEGIN 
fi END; (* WriteDirectory =) 
I PROCEDURE GlobalFilter (VAR storage: TEXT); 
VAR x 
directory: DirectoryTyp®; 
f] BEGIN 
ReadDirectory(directory, storage); 
ProcessDirectory(directory); 
WriteDirectory(directory, storage) 
END; (* GlobalFilter *) 
ii BEGIN (* Main Program *) 
GlobalFilter(data) 
{| END. - (* Main Program *) 3 
i 8.3.4 Efficiency 
A topic which comes up sooner or later in all programming courscs is efficiency. We have purposely 
f] said: very little about this topic up to this point. The term efficiency is itself a bit vague. Efficiency refers 
' to the consumption of some resource. ‘The two resources normally associated with a computer program 
; are the computer processing time and the computer memory. However, there is an often overlooked 
ft resource: programmer time. You should understand which of the three resources most affects efficiency, 
because trying to optimize the use of one of them usually costs more of one or both of the remaining two. 
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We have avoided discussing efficiency because 
we term “efficiency hangup 
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all too often a beginning programmer can fall prey to what 
" if exposed to the notion of efficiency too carly. We adhere to the following 


philosophy: ' 


e Make the program simple. 
e Make sure the program is correct. 


o If necessary, make the program faster. 


ating an addition operation here or a subtraction operation 
addition only requires 10 microseconds of computer time, 
addition operations from the program to save only one 
a bit more programmer time to save one second of 
few seconds of CPU time. Resist 


All too often we find programmers climin 
there in the name of efficiency. However, if an 
the programmer would need to remove 100,000 
second of CPU time! It would no doubt take quite 
CPU time [4]. Resist the temptation to twiddle your program to save a 


getting the “efficiency hangup”. 


correct before you try making it faster. A program that doesn’t work is” 
of no value regardless how fast it is. Besides, anyone can make a programs arbitrarily fast if it doesn’t 
have to work. Correctness is a must before efficiency is considered. Once the program is finished and 
you know it works, then consider its speed. Quite often the program runs fast enough, If the program 
must be faster to be acceptable, then and only then should you consider ways to make it faster. 


‘The program should always be 


orams, 5-10 % of a program actually consumes 90% or more of the CPU time. So if you 


In most prog 
want to make a program faster, you should concentrate on that 5-10 %. Unfortunately, programmers are 


inherently poor at guessing which portion uses the most time. ‘The best way to determine it is by timing 
your program. Many systems provide a ‘program profiler which gives information on how many times 
cach subprogram gets called and how much time each takes. With this information, you tan usually 
determine the target code pretty quickly. If you don’t have a profiler, you can add timing and counting 
manually. Most implementations provide access to a system ‘clock (which typically counts in 


milliseconds) that you can usc to time a program and its subprograms. 


Once you determine which portion of the program to speed up, study the algorithm used in that 
portion of the program and try replace it with a faster algorithm. Once the algorithm has been replaced, 
time it again. Since we will examine algorithms and their analyses in the second part of this book, we will 


say no more about efficiency now except for a few reminders: 


e Don’t sacrifice clarity for efficiency. 
o Don’t sacrifice correctness for efficiency. 
e If the program works correctly but it is too slow, then try to make it faster. 


e Avoid the “efficiency hangup”. 


4 


8.3.5 Readability 


We have already pointed out that progrems are read by people rather than just by machines. Since 


people read programs, every programmer ‘should adopt a scheme of program layout which makes his 
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progran casier to read. Just as one uses margins, indentation, blank lines cte to mike letters, reports and 
textbooks more readable, such things should be used to enhance the readability of computer programs. 
Below we list some of things we have already suggested to make your programs more readable: 


o Put cach statement on a separate line by itself, 
e Usc blank lines to separate things which are logically unrelated, ° 


e Usc indentation to show the logical structure of a program. For example, the statements ofa 
subprogram are indented as are the statements of a BEGIN-ENDD block, and the statements of 
a REPEAT-UNTIL loop. Inconsistent indentation can be worse than none at all. 

e Use both upper and lower casc characters if available to make identifiers more readable and 


emphasize keywords. 
e Usc redundant parentheses to avoid ambiguity in expressions. 


acing throughout program. Use the same number of blank lincs between 


e Use consistent sp 
ame number of spaces around operators’and between 


cach subprogram declaration, the s 
clements in lists such as paramcter lists. 


The table below summarizes the formatting conventions we have tried to follow throughout this text: 


a 
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‘Table 8-1; Formatting Conventions 


Blank Lines a 
number before cach subprogram declaration ; DE 
number before cach subprogram comment 
number before CONST 
number before TYPE 
number before VAR 
number before subprogram BEGIN 


eee NN 


New Lines 
put program header statement on a separate line 
put cach subprogram header on a separate line 
put keywords on separate line (CONST, TYPE, VAR, BEGIN, END, WHILE) 
_put cach statement on a separate line 
put cach constant definition on a scparate line 
put cach type definition on a separate line 
put cach variable declaration on a separate line 


Spacing s 
« number before program name, - 

number before program parameter list 
number after cach program parameter 
number before cach subprogram name 
number before subprogram formal parameter list 
number after cach formal parameter 
number after formal parameter type 
number between cach actual parameter 
number before function type 
number between operator and operands 


pmi pad pt ND pt pd paa pd p at 


Indentation : 
amount for constant definitions 
amount for type definitions 
amount for variable declarations 
amount for BEGIN-END blocks (including main program and subprograms) 
amount for THEN and ELSE clauses 7 
amount for statements in REPEAT-UNTIL loop 


aan H DE 


Programs which look nice are nearly always more readable. You should strive to create good-looking 


programs. However, don’t wait until the program is finished and then go back to format the program. 
Keep your program tidy from writing the first line of code. It soon becomes second nature and is no 
chore at all. ‘here are programs which format programs. Such formatters take a program as input and 


generates a formatted program as output. Writing such a program can be very educational, and the- 


resulting program can be very useful. Situations where many programmers work together (software 
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teams) find pretty printers or formatters extremely useful since everyone's code looks just like their own 


and is therefore casier to read. 


‘The following is an example of a poorly formatted program. It is quite difficult to read: 


AR DIRECTORY: DIRFCTORYTYPE ;VAR FROM: TERT); i 
INLIDIRECTORY(DIRECIORY): RI SET(EROM); NUMBER: =0; 
MAXFILES) DO BIGIN NUMBER:=NUMBER+1;WITH 
AML) ;READ( FROM, PAGESIZE); 


PROCEDURE READDIRECTORY(V 
VAR NUMBER: INTEGER; BEGIN 


WHILE NOT( EOF (FROM) AND (NUMBERS 
DIRCCIORY. FILCS[NUMBER] DO PEGIN READNAME (FROM, N 


READLN( FROM) END END; DIRECTORY .SIZE:=NUMBER END; 

Now let's look at the exact same program which utilizes blank lines, indentation, upper and lower case, 
cte to improve its readability: 

PROCEDURE ReadDirectory (VAR directory: DirectoryType; VAR from: TEXT); 


VAR 
number: INTEGER; (* used to count number of files in directory *) 


BEGIN 


InitDirectory(directory); (* initialize directory structure a} 


(* open file for reading *) 


RESET(f rom); 
number := 0; (* initialize number of files read *) 
j WHILE NOT(EOF(from) AND (number < MaxFiles) 00 


BEGIN 
number := number + 1; 
WITH directory.files[number ] DO 


BEGIN 
ReadName(from, name); 


READ( from, pagesize); 
READLN( from) 
END (* WITH *) 
END; (* WHILE *) 


directory.size := number (* store total number of files read *) 


END; (* ReadDirectory *) 


It truly is much more readable. 


8.3.6 Understandability 


adable program is very desirable, however, 
difficult to understand. Understandable programs 
ogram understandability as indicated below: 


A good-looking re it is not cnough. Well-formatted 
programs may still be are even more desirable. There 
are a few things we can do to increase pr 


o Make code simple. Don’t try to be overly clever. Strive for simple and direct programs. 

e Use mnemonic identifier names. We have said this many times before but a program which 
uses names such as a, b, and c instead of total, revenues and profit simply is less 
understandable. Every time we sce b we must think that means total. Why not just the 


identificr total? 


column, columns, colmn, 


If a procedure has six local variables, 
dentifiers which 


e Use distinct variable names. 
be difficult to understand. Avoid using i 


colmns, col, and cols, is going to 
differ in only a few characters. 
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e Use prefixes and suffixes for identifier names. We normally use the suffix Type for all 
user-defined types (like Directory Type and FileType). 

e Group logically related procedures together. In languages which support modules or units, 

put logically related subprograms together in a separate module. 


Consider the following procedure declaration: 


PROCEDURE I1 (VAR I2: 13; VAR 14: TEXT); 


VAR 
15: INTEGER; 


BEGIN 
16(12); 


RESET(14); 


16:5: =-0; 
WHILE NOT(EOF(14) AND (15 < 17) DO 
BEGIN 
I5 := 16 + 1; 
WITH 12.18[15] DO 
BEGIN 
19(14, 110); 
READ(14,. 111); 
READLN( 14) 
END À 
END; : : 


12.112 := 16 
END; 


It’s difficult to understand isn’t it? Do you recognize it? It’s the procedure ReadDirectory from the 
previous page. We simply replaced all the identifiers with nonmnemonic identifiers (11, 12, 13, etc). 
Picking meaningful identifiers really does have an effect on readability and understandability. 


8.3.7 Example of Program Implementation 


Below we give the monitor program with nearly all the stubs refined into working Pascal code. 


/ 
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PROGRAM Monitor (INPUT, OQUIPUT, data); 3 


CONST 
MaxFiles = 100; (* maximum number of files in directory structure $) 
TYPE 4 ; i 
FileType = RECORD chy See 


name: StringType; 

pagesize: INTEGER; 

deleted: BOOLEAN 
END; (* FileType *) 


Filelistlype = ARRAY [1..MaxFiles] OF FileType; 


DirectoryType = RECORD 
files: FileListType; 


_ size: INTEGER 
END; (* DirectoryType s) 


VAR z 
data: TEXT; 
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PROCEDURE ReadDirectory (VAR directory: Directorylype; VAR from: TEXT); ~ 

VAR . : l 
number: INTEGER; (* used to count number of files in directory *) 

BEGIN : 
InitDirectory(directory); (* initialize directory structure *) i 
RESLT( from); (* open file for reading *) | 
number := 0; (* initialize number of files read *) | 
WHILE NOT(TLOF(from) AND (number < MaxFiles) DO : 

BEGIN 4 
number := number + 1; a 
WITH directory.files[number] DO ‘ . 
BEGIN 
ReadName(from, name); 
READ(from, pagesize); 
READLN( from) p 
END (* WITH *) 
END; (* WHILE *) 
directory.size := number (* store total number of files read *) I 

END; (* ReadDirectory *) 

PROCEDURE WriteDirectory (directory: DirectoryType; VAR to: TEXT); : i 

VAR 
index: INTEGER; 

BEGIN ; 3 - : = 
REWRITE(to); (* open file for writing *) (| 
FOR index := 1 to directory.size DO : 

“WITH directory.files[index] DO 
BEGIN 
WriteName(to, name); 1 
WRITE(to, pagesize); 
WRITELN( to) 
END (* WITH *) l 
END; (* WriteDirectory *) 
a. | 
| 
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FUNCTION GetCommand: CHAR; 


CONST 
prompt = "Monitor>'; 


VAR : 
cmd: CHAR; es E 


BEGIN 
WRITE(OUTPUT, prompt); 
READLN(OUTPUT, cmd); 
GetCommand := cmd 

END; g" GetCommand *) 


PROCEDURE ListAll (directory: Directorylype); 


VAR 
index: INTEGER; 
PageTotal: INTEGER; 


BEGIN : 
WRITELN(OUTPUT, ‘File Pages Status’); 


PageTotal := 0; 
FOR index := 1 TO directory.size DO 


WITH directory.files[ index] DO 


BEGIN > 
WriteName(OUTPUT, name); 


WRITE(OUTPUT, pagesize); 


IF deleted : 
THEN WRITE( OUTPUT, "DELETED' ) 


ELSE WRITL(OUTPUT, "NOT DELETED"); 
- WRITELN( OUTPUT); $ 
PageTotal := PageTotal + pagesize 
END; (* WITH *) 
WRITELN(OUTPUT, PageTotal, ‘pages in ', directory.size, ‘fites.*) 
END; (* ListAll *) 


A) 
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PROCLDURE List (directory: DirectoryType}; 


VAR 
index: INIEGER; s 
Pagelotal: INTEGER; 

BEGIN 


WRITELN(OUTPUT, ‘File Pages"); 


Pagetotal := 0; 
FOR indox := 1 TO directory.size DO 
WITH directory.files[ index] DO 
1F NOr(deleted) 
THEN BEGIN 
WriteName(OUTPUT, name); 
WRITE (OUTPUT. pagesize); 
WRITELN(OUTPUT); - 
PageTotal :* PageTotal + pagesize 
END; (* THEN *) 


WRITELN(OUTPUT, PageTotal, "pages in ', directory.size, 
END; (* List *) 


PROCEDURE ListDeleted (directory: DirectoryType); 


VAR 

index: INTEGER; 

PageTotal: INTEGER; 
BEGIN : 2 
WRITELN(OUTPUT, ‘File Pages’); 


PageTotal := 0; 
FOR index := 1 TO directory.size DO 
WITH directory.files[ index] DO 
IF deleted 
THEN BEGIN 
WriteName(QUTPUT, name); 
WRITE( OUTPUT, pagesize); 
WRITELN( OUTPUT ) ; 
PageTotal := PageTotal + pagesize 
END; - (* THEN *) 
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'files.') 


WRITELN(OUTPUT, PageTotal, ‘pages in ', directory.size, ‘files.") 


END; (* ListDeleted *) 


oal | 


File Date: January Ist 1985 at 20:21 
Kile Name: CHAPTERSC.MSS.44 line 401 


Scribe Date: January 2nd 1985 at 00:36 
Scribe Version: 4(1400) 








Programming Style 437 


PROCEDURE Delete (directory: DiroctoryType); 


BEGIN 
END;  (* Delete *) 


PROCEDURE UnDelete (directory: DirectoryTypo); Bee ate 


BEGIN 
END; (* UnDelete *) 


PROCEDURE Create (directory: DirectoryType); 


BEGIN 
END; (* Create *) 


PROCEDURE Exit; 


BEGIN 
END; (* Exit *) 


PROCEDURE ProcessDirectory (VAR directory: DirectoryType); 


BEGIN 
REPEAT 
command := GetCommand; 


| ee ee List(directory); 
nl 5 arna, ea Delete(directory); 
A eee RET UnDelete(directory); 
of a pe Create(directory); 
Pep ListDeleted(directory); 
E a e ListAll(directory) 
eee fe? Exit; 
OTHERS: GiveHelp 

END; CEASE SY - 

UNTIL (command = 'E') OR (command ='e') 
END; (* ProcessDirectory +) 


PROCEDURE Globalfilter (VAR storage: TEXT); 


VAR 
directory: DirectoryType; 


BEGIN 
ReadDirectory(directory, storage); 
ProcessDirectory(directory); 


WriteDirectory(directory, storage) 
END; (* GlobalFilter =) 


BEGIN (* Main Program a | 
. GlobalFilter(data) 
END.  (* Main Program *) 
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8.4 A Style of Program Verification 


After we finish implementing the program we enter the final phase of programming. verification and 
testing. This phase is to determine two things: (1) Does the program do everything it is supposed to? and 
(2) docs the program do everything correctly?. 


juestion is by comparing the program with the functional specification 


‘The way answer the first € 
(generated in phase 1) as well as the original problem statement. So long as no required features got lost 


somewhere during the design or implementation phases the program should be functionally acceptable. 
If for example, we neglected to write all directory back out to storage or neglected to provide the ability to 
list deleted files option we have uncovered an unsatisfactory result 


Answering the second question is much harder. A program may appear to work and yet we still are 
unsure if it always works correctly under all circumstances. There is a way to prove that a program works 
‘This method is very formal and requires the use of a specialized language know 
ficult (most think it much more difficult than programming), 
tedious and time-consuming virtually no one proves their program correct in a formal sense. ‘The most 
common way to verify a program is with testing. The problem with testing is that the test only tells us if 
the program works for the particular test data we use. Therefore to completely test a program, we must 
give it all possible test data. But if a program requires only one integer, there arc already infinitely many 
possible inputs to the program. Even when the possible inputs arc finite they are often too many to try 
them all. Hence it is usually impossible in a practical sense to completely test a program We must scttle 
for a subset of all possible test data. The trick then becomes picking a reasonable set of test data so that 
we are reasonably sure that the program works. W have some suggestions for selecting test data: 


correctly in all situations. 
as predicate calculus. Because is is dif 


ə Test common valid input. For monitor program we would at least want to pick all 6 of the 
valid commands, L, D, U, C, P, A, E. 


o Test fringe or boundary data. If we were writing a program to test integers for primality, we 
should definitely choose 2 as a test case since it is the smallest prime number. Similarly we 
should select the integer 1 since it is on the other side of the boundary. When we have 
numeric input it is a good bet to try 1,0, and — 1. Also consider the boundary conditions for 
data structures, like 0 elements in an array as well as the maximum allocated. 
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o Test clearly invalid data. For example, in our monitor program we would want to test some 
invalid command such as x, y, and 7. We would also want to try such things as deleting 
nonexistent files, deleting already-deleted files, or creating already-existing files. When we 
have numeric input we must always try other positive and negative numbers. 


Consider a function to determine if an integer is odd orcven. An integer is even if there is no 
remainder when divided by 2 and odd otherwise. Consider the following Pascal function to 


determine odd or even: 


FUNCTION Odd (number: INTEGER): BOOLEAN; 


BEGIN 
odd := (number MOD 2) = 1 


END; 


The above function returns ‘TRUE if the remainder after division by 2 is 1 and FALSE 
otherwise. For 7, 7 MOD 2 gives 1 hence odd. Similarly, 8 MOD 2 gives 0 hence not odd. 
But what about negative input? —7 MOI) 2 yields — 1 rather than 1 hence the function 
returns FALSE. But we know — 7 is just as odd as 7 is. To correct the above function we 


should change it to: 
FUNCTION Odd (number: INTEGER): BOOLEAN; 


BEGIN 
odds := (number MOD 2) <> 0 


END; 


‘Consider the following function: 


FUNCTION UpperCase (ch: CHAR): CHAR; 
‘CONST 
CaseDiff = 32; 


BEGIN 
IF (ch >= 'a') AND (ch <= '2') 
THEN UpperCase := CHR(ORD(ch) - CaseDiff) 
ELSE UpperCase := ch 
END; (* UpperCasa *) 


t u 


We will want to try ‘a’ and ’z’ since they are boundary values. We need only try one value outside the 
range: we necd not try all remaining values, all uppercase, all digits, all punctuation, ctc. Often times 
input falls into vo or more classes. If it works for one value in class, we needn't check any others in the 
class. the above function has two classes, those in the range ’a’.. 'z’ and those which aren’t. Hence we 
could pick ‘d’ and °D’ and be sure, it always worked of worked for those two. We would still pick the 


boundary conditions out of habit. 
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8.5 Debugging 


In testing your program you will undoubtedly uncover at least one bug. Itis not unnatu ral to find bugs 
in your program, in fact, it is in unnatural to not find them. Computer programming requires an 
inordinate amount of attention to detail to which we as humans simply are unaccustomed. It is relatively 
easy to find bugs, it is much more difficult to fix them. We must first identify what is wrong. We must 
then determine which portion of the program is causing the bug. And lastly we must determine what to 


do to rectify the situation. 


Most programming systems now provide a debugger for debugging programs. Debuggers typically 
provide the facility to trace a program’s execution, the ability to query the values of variables. A good 
debugger can not be surpassed in aiding in the location and correction of bugs. If no debugger is 
available on your system you will need to add statements to your program to locate bugs. ‘The natural 
thing is to simply add WRITELN statements to your subprograms printing out input values and key 
variables. However, after the bug is located and fixed we no longer need the debugging output. Instead 
of just throwing away the debugging WRITELN’s we suggest using conditional debugging statements as 
indicated below: 


IF debugging 
THEN WRITELN(OUTPUT, debugging stuff goes here) 


Where debugging is a named constant. That way if we later need to debug a subprogram again we 
simply need to change the valid of debugging to determine of debugging W RITELN exccutes. We have 
found a variant of the above debugging even more useful: 


IF ProcDebugging OR PgmDebugging 
THEN WRITELN(OUTPUT, debugging stuff goes here) 


Note this has a compound condition. There is a named constant for the procedure and one for the 
whole program. That way we can get the debugging information for just one procedure alone or we can 
get it for all of them by simply changing the value of the global constant. 
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t #1: Programming Methodology 


Directions: For each of the following decide which is the best choice. 


—_, a =m = 


1. Which of the following is (are) advantages of a high-level programming lan- 
guage? 


I. Eliminates the need for tedious machine-code programming 
Il. Easier to understand and modify programs 
Ill. Deciding which program to run next for the system 


(A) I 

(B) I 

(C) II 
(D) I&II 
(E) I& M 


2. Which of the following is (are) among the legitimate purposes of software 
documentation? - : 


I. To assist in maintenance and modifications. 
Il. To describe'the capabilities of the program. 
Ill. To:provide the user with instructions. 


(A) I only 

(B) I & Ill only 
(C) I & Ill only 
(D) II only 

(Ej I, II and Ill 


3. Which of the following is not an example of program documentation? . 


I. Algorithm 
Il. Identifier names 
Ill. Specifications 
IV. Object code 


(A) I 
(B) I 
(C) 1 & Ill 
(D) IV 
` (E) all of the above 


ee? 
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4. The main job of a compiler or interpreter is to 
(A) turn algorithms into programs 
(B) tell the computer exactly what to do 
(C) provide documentation for a program 
(D) translate a high-level language into machine language 
(E) allocate processing time 


5. All of the following are essential to stepwise refinement except 


I. stating the problem simply £ 
I. decomposing the problem into subproblems 
Ill. refining the subproblems 
IV. declaring all variables 

V. inputing the data 


(A) T& IV 

(B) IV 

(C) IV&V 
_{(D) Vv 

(E) I&II 


6. A top-down approach to programming calls for 


I. working from the general to the specific 
II. postponing minor decisions : 
Ill. a systematic plan 

IV. immediate coding of the problem 


(A) 1 & Ill 

(B) IV 

(C) I, II and III 

(D) I 

(E) all of the above 


7. Top-down programming facilitates 


I. implementing a complex problem in stages 
II. testing of individual modules 

III. readability 

IV. maintenance of the program 


(A) I 

(B) I&I 

(C) 1, 2 & Il 

(D) II & Ill 

(E) all of the above’ 


_ 8. You are required to write a program to find the average of the class grades on 
your last A.P. Computer Science test. The best thing to do first is 
(A) input the data 
(B) add the grades 


@? 





11. 


12. 
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(C) ask for names 
(D) write your program 
(E) write an algorithm 


If all run-time errors are removed, which of the following may prevent your 
program from performing its function? 

(A) passing illegal parameters to standard functions 

(B) undefined variables 

(C) stack overflow 

(D) logic error 

(E) dividing by zero 


A loop which averages grades that can range from 60 to 100 begins 
i i while Grade <> 65 do. 

A better loop condition to accept the necessary grades is 

(A) while Grade < 100 do 

(B) while Grade > 60 do 

(C) while Grade = 60 do 

(D) while Grade >= 60 do 

(E) none of the above 


Repeated execution of simple computations may cause compounding of 
(A) syntax errors 


_ (B) run-time errors 


(C) roundoff errors 
(D) logic errors 
(E) none of the above 


` Flightly Air Express.charges by integer values of weight. Charges increase for 


weights greater than 2 pounds and again for weights greater than 5 pounds. 
The best minimum set of data to test a program which calculates charges 
would be ; 


Questions 13 and 14 are based on the following program fragment: 


Twos := 2; 
while Twos < 100 do 
begin 
writeln (Twos); 
Twos := Twos * 2 
end; (*TWOS TABLE‘) 


13. The last value printed is 


(A) 100 
(B) 128 
(C) 64 
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14, 


I5: 


16. 


Iz: 


18. 





(D) 2 
(E) none of the above 


The last value assigned to Twos is 
(A) 100 

(B) 128 

(C) 64 

(D) 2 

(E) none of the above 


The “Sweets” wholesale candy company sells candy for $2.00 a pound if you 
order from 1 to 10 pounds, $1.75 a pound if you order from 11 to 20 pounds, 
and $1.60 a pound for orders more than 20 pounds. What would be a mini- 
mum set of test data to make sure a program to keep track of orders is 
working properly? < 

(A) 5, 10, 15, 20 

(B) 1, 11, 21 

(C) 3 135.20, 21 

(D/O): 30; 20 — =<. 

(EL 1,5, 16; 11,- t520, 21, 25 


If the “Sweets” company changes their policy and will not accept an order 
less then 5 pounds, and the program is modified to account for this, what 
would be a good set of test data to check the new program? 

(AJ 5, 10, 15, 20 - CE ~ 

B- 175, EL 21 ; 

(C) 5, 6, #1, 21} 

(D} 0, 5, 10, 20 


{E) 1, 4, 5, 6, 10, 11, 20, 21 


Thorough testing of a program will ensure that it performs as expected. A 
program should be tested 


I. for each module in the program 
II. with sample data 
III. for every path through the program. 


(A) I 

(B) I 

(C) I 

(D) I&II 
(E) I, I, & II 


Structured coding involves constructing programs with 


I. sequential statements 
II. conditional statements 
Ill. iterative statements 


(A) I only 
B) IQ Il | 


Qa == ae meeps E a ee = 


19. 


20. 
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(C) I&II 
(D) If & Il 
(E) L 0, & Wl 


Modular programming involves 


- I. modules that solve individual tasks 


II. problems decomposed into logical subproblems 
Ill. proceding from the particular to the general 


(A) 1&ll 

(B) I & Il 

(C) 1 & Il 
{D} III only 
EI I, & Il 


When several programmers are involved with one design, which of the fol- 
lowing is likely to cause the most problems? 

(A) Modular programming 

(B) Top-down design 

(C) Step-wise refinement 

(D) Bottom-up design ` 

(E) Structured programming 
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AP 1.28 AP Supplement Chapter One 


Chapter AP 1 Test 


PART I Choose the best answer for each of the following questions. 


—— 


I. 


An "assertion" is 


a) a Boolean condition. 

b) a declaration about the current status of a program. 

c) any comment in a program. 

d) a comment which describes the inputs and outputs of a procedure. 
e) a method. of proving that a program has no bugs. 


An “include file" can be used to 


a) store data or text for input to a program. 

b) store data or text which is output from a program. 

c) store a complete definition of an abstract data type. 

d) keep track of which modules must be recompiled after a program is 
modified. 

e) list all modules needed by a particular program. 


A "subprogram" is 


a) any program stored in an include file. 

b) a group of procedures and functions which perform operations on an 
abstract data type. 

c) any recursive procedure or function. 

d) any procedure or function which performs an operation whine an 
abstract data type. 

e) any procedure or function. 


When a procedure generates an "exception" 


a) the procedure should be rewritten as a function. 

b) it should record the fact, either by notifying the user or modifying a 
Boolean variable. 

c) the exception should be stored in an array or disk file only. 

d) it has executed the ELSE section of an IF-or CASE statement. 

e) Procedures cannot generate exceptions, only functions can. 


The simplicity rule says that 


a) all subprograms should not exceed a certain length. 

b) all subprograms should perform a single, well-defined task. 

c) subprograms with parameter lists should only have a limited number of 
closely related parameters. 

d) (a) and (b) only. 

e) (a), (b) and (c). 


Information hiding requires that 


a) the details of the construction of a particular subprogram has no effect 
on any other subprogram. 
b) individual data items are hidden as the fields in a record. 


c) new data types be created. 
d) integer values be stored in string variables and converted when needed. 


c) subranges be used to limit the range of possible values, 
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A low-level subprogram 


a) is built-in to the compiler, such as trunc or chr. 

b) must be written in assembly or machine language. 

c) makes use of the construction of an abstract data type in order to 
perform some operation with or on it. 

d) manipulates some global variable. 

e) is any subprogram which uses an abstract data type in any way. 


A "precondition" is 

a) a description of the conditions which must be true at the beginning of 
a subprogram for that subprogram to work correctly. 

b) the initialization of certain program variables (notably arrays) to 
insure that undefined values are not used in calculations. 

c) code that checks the value of any input parameters to a subprogram. 

d) code which checks the value of a subprogram’s output parameters 
before calling another procedure or function. 

e) the first comment in a program, which describes the purpose and 
actions of that program. 


Which of the following is a proper assertion for the actions performed by 
the code shown below? 


repeat 
<statements> 
DDEIL X > 8s 


a) (* X <= 8 *) 
b) (* X is greater than or equal to 8 *) 
c) (* X is less than or equal to 8 *) 

d) (* X > 8 F) - 

e) (* X is undefined at this point *) 


Which of the following is a proper assertion for the actions performed by 
the code shown below? 


Pos := 1; . 
while (Pos < MaxPos) and (Line[Pos] = ’ ’) do 
Pos z=. Pos=t217 : 


a) (* Counts the number of blanks in LINE *) 
b) (* Pos < MaxPos *) 

c) (* Line[Pos] <> ’ ’% *) 

d) (* Pos < MaxPos and Line[Pos] <> ’ *% *) 
e) (* Pos >= MaxPos or Line[Pos] <> re ae 


A global Boolean parameter named Debug may be used to 


a) turn the compiler’s debug checking on and off. 

b) produce visual checks of postconditions. 

c) turn the printing of debugging information on or off by enclosing 
such checks in an IF statement beginning with: IF Debug ... 

d) determine if an error has occurred: Debug is TRUE when this has 
happened. 

e) have the compiler check if the index to an array is within the 
‘declared minimum and maximum limits. 
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14. 


15. 


16. 


17. 
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When information hiding is used 


a) programs are modular and it is easy to reuse subprograms in different 
programs, 

b) low-level subprograms may be modified without changing any other 
code. š 

c) programs are more reliable because it is easier to test the individual 
subprograms without having to worry about unanticipated interactions 
between subprograms. 

d) (a) and (b) only. 

e) (a), (b) and (c). 


Program decomposition is ` 


a) the process of breaking a program into procedures and functions. 

b) the process of starting with the general and working towards the 
specific. 

c) "commenting out" debugging and test code in the final version of a 
program. 

d) testing a program subprogram by subprogram. 

e) None of the above. : 


When a variable is used only by one subprogram, or is used by different 
subprograms but does not transfer any useful values between them: 


a) the variable should be passed as a value parameter. 

b) the variable should be declared locally. 

c) the variable should be re-declared as a global constant. 
d) the variable should be declared globally. 

e) the variable should be passed as a variable parameter. 


A long parameter list to a subprogram indicates that: 


a) the subprogram should be written as a procedure. 

b) the subprogram should be written as a function. 

c) the subprogram may be trying to do too much and should be rewritten 
as several different subprograms. 

d) the subprogram is violating the rules of information hiding and 
should be using records in the parameter list. 

e) the subprogram is a high-level procedure or function. 


A "module" consists of 


a) the declaration of an abstract data type and all of its associated low- 
level subprograms. 

b) a group of include files. 

c) the CONST and TYPE declaration blocks in a program. 

d) a group of subprograms which perform a well-defined function. 

e) a group of programs which access the same data file or files. 


A "postcondition” is 

a) a listing of the values produced by a subprogram. 

b) an assertion that describes what a subprogram has accomplished. 

c) a comment which describes the actions of the main program. 

d) a condition which indicates whether a subprogram has succeeded or 
failed ; 

e) an assertion which describes the actions performed by a loop. 
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18. Which of the following is a proper assertion for the actions performed by 
the code shown below? . 


SEx > y 
then Y = X; 


DEX S Y *) 

DEN x *) 

c) (4% > ¥ and: ¥ = 2-3) 

d) (* Y is the larger of X, Y *) 
e) (* X is the larger of X, Y *) 


19. If the code to test the postcondition of a subprogram is not significantly 
simpler than the subprogram itself, 


a) the testing code should be rewritten as a separate procedure. 

b) the subprogram should print an error message and halt execution of 
the entire program. : 

c) the testing code may contain bugs and should be removed. 

d) the subprogram is trying to do too much and should be rewritten. 

e) the testing code should be a visual check. 


20. Which of the following is a proper assertion for the actions performed by 
the code shown below? 


while not eoln(InFile) and (Pos < MaxLine) do 
“begin 
Line[Pos] := Ch; 
Pos := Pos + 1 
end; (* while *) 


a) (* InFile is at end of line *) 

b) (* InFile is at end of file *) 

c) (* InFile is at end of line or Pos >= MaxLine *) 
d) (* Pos <= MaxLine and InFile is at end of line *) 
e) (* InFile is at end of line or Pos > MaxLine *) 


PART II Answer the following questions based on the information given below, 

keeping the rules of information hiding, program decomposition and abstract data 

types in mind: ; 
Students at The Lawrenceville School attend eight class periods a day. These 
classes are chosen from the departments of Math, English, Science, 
Humanities, Language, Fine Arts, and Physical Education. In addition, each 
student has at least one free period from the possible eight. Each semester 
students receive a schedule of their classes showing the period number, 
department, teacher’s name and room number for each class. 


a) Write a complete type definition to store the schedules for the entire incoming 
freshman class (100 students) for the upcoming semester. 


b) Write procedure or function headers for two low-level subprograms which 
manipulate the types written above. Include pre- and postconditions. 


c) Write a header for a high-level subprogram which uses this type. Include a pre- 
and postcondition. cae 


PART III Write the bodies of the. subprograms described in parts (b) and (c) 
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MULTIPLE CHOICE 


9-1. 


9-3. 


9-4, 


9-5. 


9-6. 


Why is a program with good style important? 

a. The compiler will be able to work more efficiently. 

b. Compiling time will be much shorter. 

c. Itis easier for people to work with and understand. 

d. It will run correctly the first time it is submitted to the computer. 


All of the following are characteristics of a program with good style, EXCEPT 


a. Itis well documented. 
b. Itcontains only functions and procedures. 
c. Indentation, blank spaces, and blank lines are used to make the program easy to 


read. 
d. The identifiers in the program are given descriptive names. 


Documentation can be divided into two categories: and 





a. good, bad c. text, software 
. b. internal, text : : d. internal, external 
Documentation used in reference or user's manuals is known as 
documentation. 
a. external C- Seat 
b. final d. software - 
Comments that are placed within a program itself is called documentation, 
a. intemal C- xt ; 
b. final d. software — 


Internal documentation is used 

a. in reference or user's manuals : 

b. at the begining and end of a program or subprogram 

c. only within the body of a program or subprogram 

d. at the beginnning of and within the body of a program or subprogram 
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9-10. 


9-11. 


9-12. 


9-13. 


` How is a real number stored in memory? It is stored 


INSTRUCTOR'S MANUAL 467 


Documentation that follows program, procedure, or function headings should include 


a. title, purpose, method, usage, calling sequence, variables, and files used 
b. title, purpose, variables, types, constants, and arrays 

c. purpose, calling sequence, variables, types, constants, and files used 

d. title, method, calling sequence, variables and arrays 


Which of the following should be accompanied with comments to explain its purpose? 
a. standard identifiers c. control statements 
b. reserved identifiers d. standard data types 


Which of the following statements makes the best use of blank spaces and indentation so 
that it is easier to read and understand? 
a. Average:= (Num1+Num2+Num3)/3; 
b. Writeln(‘The Average of, Num1:4,Num2:4,Num3:4, 'is', Average:4); 
c. fori:=1to5 do 
for j := 1 to 10 do 


gin 
writeln (i*j); 
end; 
d. while x <> 3 do 
` begin 
write (x); 
x:=X+l; 
end; 


In which of the following cases would it be good programming style to use blank lines? 
a. around control structures; that is, before and after a loop 

b. after the identifier 'begin' 

c. before the identifier ‘end’ 

d. before and after writeln statements 





a. asa single binary number 

b. in two parts, a mantissa and an exponent 

c. in two parts, an integer number and an exponent 
d. the same way an integer number is stored 


What is the drawback of real-number storage? 

a. The outputted real number is difficult to read and understand. 

b. The outputted real number contains too many digits. 

c. A decimal number may not have an exact binary equivalent, and rounding errors may 
occur in converting to exponential notation. 

d. The computer can only store ten digits in the mantissa. 


The inaccuracy of the representation of real numbers is called __ error. 
a. representational c. canceilation 
b. numerical d. precision 


In the following case, what error has occurred? 

45.00 + .00000000000234 = 45.000000000 

a. representational c. cancellation 
b, numerical d. precision 
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9-15. 


9-16. 


9-17. 


9-19. 


9-20. 


9-21. 


9-22. 


9-23. 


9-24. 


TURBO PASUAL FRUGHKANIMNING PYVUAT 


In TURBO Pascal, the range of real numbers is from 1E - 38 through 1E + 38. Which 
of the following numbers would cause a numerical underflow? 

a. 2.35E + 39 c. 4.56E -37 

b. 1.23E-01 d. 2.34E - 39 


A numerical overflow occurs when : 

a. anumber smaller than the smallest number the TURBO system can represent is used 
b. anumber larger than the largest number the TURBO system can represent is used 

c. alarger number cancels out the smaller one aay 

d. too many numbers are being stored in memory 


The three types of program errors are errors. 
a. syntax, semantics, and logic c. syntax, logic, and run-time 
b. semantics, logic, and run-time d. syntax, typographical, and run-tirne 


Which of the following statements about syntax errors is false? 

a. A syntax error will cause a program to stop executing prematurely. 

b. Syntax errors are also called compile-time errors. 

c. Syntax errors are violations of the grammatical rules of a language. 

d. Syntax errors are discovered when an attempt is made to compile the program. 


What error causes abnormal behavior during execution, causing the program to crash? 
a. syntax ` c. logic. 
b. semantics . d. run-time 


Programs that will compile, execute, and output results, but whose output will 


sometimes be incorrect, contain errors. 

a. syntax c. logic 

b. semantics d. run-time 

What kind of error occurs in the following expression : Average := A+ B/2,; 
ʻa. logic f c. rùn-time 

b. syntax : d. semantics 


Assume a while/do loop needs to be executed 50 times, and the following expression is 

the controlling condition: while x < 50 do. Will an error occur? If yes, what kind of 

error is it? 

a. Given the condition above, the loop will execute 50 times; there is no error. 

b. The expression contains a syntax error; there should be a semicolon (;) after the 
expression. 

c. An off-by-one error will occur; the loop will execute only 49 times. 

d. A run-time error will occur; the loop will execute only 49 times. 


The two methods of testing programs for errors can be divided into two categories : 

a. “text testing and run-time testing © c. desk checking and static checking 
b. static testing and run-time testing d. desk checking and run-time testing 
Which form of testing involves carefully examining the text of the program for errors? 
a. static c. text 

b. run-time d. trial and error 


ee en a 
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Sometimes the compiler will indicate a syntax error on a particular line, but when it is 

examined, no error can be found. What has probably happened? 

a. The compiler has made a mistake. 

b. The compiler has detected a logic error. 

c. An earlier syntax error has caused the compiler to flag later statements that are 
correct. 

d. The compiler'has detected a run-time error. 


A good strategy for locating run-time errors is to use ; 

a. writeln statements at locations in the program where there might be a problem 
b. readin statements at locations in the program where there might be a problem 
c. control statements after each procedure 

d. the desk checking testing method 


Use the following program to answer questions 27-29. 


Assume the task of the following program is to print the numbers from 2 to 5, but that there is an 
error in the program. 


—_O* 


Ps 
ON 
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9-28. 


9-29. 


9-30. 


program PrintNumbers (input, output); 
var 
i: integer; 


begin (* PrintNumbers *) 


i:=1; 
while (i >= 1) or (i < 5) do 
begin (* while *) 
i:=i+1; 
writeln (i); 
end; (* while *) 


~ end. (* PrintNumbers *) 


In which line does the error occur? 


ac? c. 8 

b. 6 T d. 9 

What kind of error is it? 

a. syntax c. logic 

b. run-time d. semantics 


What will the error cause the program to do? 

‘a. The program will not be executed. 

b. The while/do loop will not be executed. 

c. The program will terminate execution prematurely. 
d. The program will go into an infinite loop. 


All of the following are common run-time errors, EXCEPT 
a. assigning a value of the wrong type to a variable 

b. division by zero 

c. a missing semicolon after a statement 

d. incorrect passing of parameters to subprograms 





9-37. 


9-38. 


9-39. 


9-40. 
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What is the effect of the run-time error? 

a. The Average computed is not the true average; Given the first input, the result printed 
will be 35 when the answer should be 15. 

b. The program will attempt to assign a real number to the integer variable Average. 

c. It will cause the program to go into an infinite loop. 

d. The program will not be executed.’ 


Locate the logic error in the program. What line is it in? 
a Cm 
b. 4 d. 6 


What is the effect of the logic error? 

a. The average computed is not the true average; given the first input, the result printed 
will be 35 when the answer should be 15. 

b. There is an incompatability of type; the second line of input will result in a value that 
is not of the same type as the variable ‘Average’. 

c. It will cause the program to go into an infinite loop. 

d. The program will not be executed. 


Why is it useful to write a program that can handle a wide variety of invalid user input 

properly? 

a. If ~ user types in an invalid input data, the program will be able to handle it and not 
cras 

. The user will be able to type in as much input data as he wants. 

. Syntax errors in the program itself will be eliminated. 

The program will not go into an infinite loop. 


aoe 


Locate the syntax errors in the following program. 


OOO AU RO 
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program CoinToss (input, output); 
const NumberOfTimes = 100; 
~ var i = integer; 
Sum, Power : real 
begin (* CoinToss *) . ~~- 
Sum = 0; 
Power := 1; 
for i:= 1 to NumberOfTerms do 
begin (* for *) 
Power := Power * 2 
Sum := Sum + i/ Power; 
end. (* for *) 
end. (* CoinToss *) 


How many syntax errors are there in the program above? 
a. 


c. 
b. 4 d. 6 

In which lines did they occur? 

a. 3,4, 11, 12 C.: 20,9, %,.14,.12 
b. 4,6,7 d. 3, 4, 6, 10, 12 
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9-31. Once a program appears to be working correctly, it should be 

a. considered finished and error-free 

b. tested once to determine if it obtains the correct results 

c. tested with a wide variety of data to determine if it obtains the correct results 
d. checked for syntax errors 


9-32. Once a program appears to be working correctly, one method of selective testing is to 
make certain that the program works properly with : 
a. boundary cases c. control statements 
b. case statements d. writeln statements 





9-33. What is the boundary case of the following expression? 
If Measurement > 12 | 
then writeln (‘His feet are very large.’) 
else writeln (‘His feet are not large.'); 
a. 10 e. -f2 
bahi A i dr 


9-34. What is wrong with the expression in question 33? 

a. If the value 12 was assigned to ‘Measurement’, no mesage would be printed. 
b. If the value 11 was assigned to ‘Measurement’, no message wouid be printed. 
c. There is a semicolon missing after the 'then writeln’ statement. 

d. There is nothing wrong with the expression. ae 


9-35. In the world of programming,.the term user-friendly refers to a 

a. user who is friendly 

b. program that is written in a manner that makes it easy for people to use 
c. program that can talk 

d. intelligent robot 


Use the following program to answer questions 36-39. 


à Program ComputeAverage (input, output); 3 

var ~ 
3 Num1, Num2, Num3, Average : integer; 

4. begin (* ComputeAverage *) 

5. Readln (Num1, Num2, Num3); 

6 Average := Numi + Num2 + Num3 / 3; 

7 writeln (Average); 

8 end. (* ComputeAverage *) 


Input 
13-17 45 
10 15 10 


9-36. Locate the run-time error in the program above. Which line contains the error? 
a. 3 Cc. 
b. 4 ; d. 6 





Chapter 6 


More on Modularizing Programs 


MULTIPLE CHOICE 


6-1. A function is a type of : 


a. procedure c. arithmetical expression 
b. subprogram d. datatype 
6-2. Which of the following is a standard function? 
a. sqrt c. div 
b. writeln : d. xor 


6-3. Which of the following statements about functions and procedures is true? 

a. A function is a type of procedure. 

b. Functions have a wider application than procedures; procedures are usually used to 
compute a single value to be retumed to the calling program, whereas functions can 
return more than one value and are often used to perform other tasks. 

c. There are only standard functions, whereas procedures can be either standard or 
user-defined, eae 

d. A function name represents the returned value to the calling program, while a 
procedure name-does not represent a value. : 


6-4. A function is executed by using a function ; 
a. call c. declaration 


b. name d. parameter 


6-5. | Which of the following is a valid function heading? 
a. function Cube (x: integer); 
b. function Cube (x:= integer): integer; 
c. function Cube (x : integer) : integer; 
d. function Cube (x: integer: integer); 





Questions 6-9 refer to the following program statement: 
N := round (23.456); 


6-6. The statement above is a 


a. procedure call c. type declaration 
b. Boolean expression d. function call 
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6-7. 
6-8. 
6-9. 


6-10. 


6-11. 
6-12. 
6-13. 


6-14. 


‘round' is a : 
a. procedure name 
b. Boolean operator 


23.456 is the 
a. actual parameter 
b. formal parameter 


Variable 'N' must be of type 
a. real 
b. integer 


xX i= (35.89); 


ap 


ap 


c. char 
d. 
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function name 


. arithmetic operator 


value retumed 


. constant value 


boolean 


Which function would take the argument 35.89 and return a value of 35? 


a. round 
b. ord 


Evaluate each of the following expressions: 


chr(5) where ord(‘A') = 1 
1. 

b D 
trunc(abs(-14.6)) 
a. 14 

b. -14 
frac(1.345) 

a: honk 

Bae os 
succ(succ('E')) 
rep 

b. ‘E* 
pred(succ('E’)) 
a. D- 

b. R! 


Cc. 
d. 


impo epo Pp 


Ap 


C: 
d. 


trunc - 
chr 


15 
14.6 


Pe 
345 


'F’ 
'g' 


'F' 
AG 


Which of the following statements about ordinal types is false? 


a. The data type real is not ordinal; the real number does not have a predecessor or 


Successor. 


mo 
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The order of the Boolean values is true, false; that is, the successor of true is false. 
The functions succ, pred, ord and chr are only used with ordinal data types. 
The ordinal values can be thought of as a list of values that has a beginning and end. 
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Use the following program segment to answer questions 17-19. 
| 


program Trace (input, output); 
type fruit = (apple, banana, orange, grape, pear); 
var A,B: fruit; 
T: integer; 
begin 
A := apple; - 
B := orange; 
while A <> Bdo. ~> 
begin 
A := succ(A); 
T :=succ(T); 
end; 
end. 


After execution of the above program: 


6-17. 


6-18. 


6-19. 


6-20. 


The value of T is : 

a. ae 
“he d. 3 

The value of A is 

a. apple c. banana 
b. orange d. pear 
The value of ord(B) is ae 

a. 2 : el 

b. orange i - d. 3 

‘All of the following are string functions in TURBO Pascal EXCEPT i 
a. length c. pos 
b. substring d. concat 


Questions 21-23 refer to the following expressions: 


6-21. 


String] := 'Mary'; 
String2 := ‘Jane’; oe 
String3 := ‘Doe’; . 


String4 := concat (String2, String], String3); 


What is the value of String4? ; 
a. 'MaryJaneDoe' c. ‘JaneMaryDoe' 
b. 'Mary"Jane"Doe' d. 'DoeMaryJane' 


6-22. Assume x := length(string4); What is the value of x? 
1 


a i3 c. 30 
b. H—-- d. string1 + string2 + string3 
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6-23. 


6-24. 


6-25. 


6-26. 


6-27. 
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Assume y := copy(string4, 6, 6); what is the value of y? : 
a. ‘Mary Doe’ : c. ‘Jane’ 
b. ‘aryDoe' d. ‘Jane Doe’ 


User-defined functions are declared ; 

a. in the body of the main program 

b. after the procedure declarations 

c. before the procedure declarations 

d. after the constant, type and variable declarations 


Which of the following is a valid function call for this function heading? 
function Grade (TestScore1, TestScore2: real; lettergradel, lettergrade2: char) char; 
a. Letter := Grade (98.5, 80, 85.6, 'A', 'B'); : 
b. writeln (Grade (80.0, 95, 'B', 'A’)); 
c. Grade (88, 90, 'A', 'A', 'A'); 
d. Grade := (78.5, 90, 'C', 'A'); 


The process which enables a programmer to keep changes made to certain parameters 
from being returned to the calling program is termed 


a. parameter call c. cal by Tocation 
b. variable call ; d. call by value 


Which of the following statements about scope blocks is true? 

a. A scope block consists only of procedures and functions. 

b. A scope block is another term for the body of a program or subprogram. 

c. A scope block refers to the portion of a program in which a particular identifier is 
defined. i 

d. The scope block consists of a parameter list and a local declaration section. 
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Questions 28-30 refer to the program outline below: 


program Main (input, output); 
const X =... 
var... 
procedure A... 
const y =... 
var... 
procedure B... 
const z... 
var... 
begin (* B *) 


end; (* B *) 
begin (* A *) 


end; (* A *) 

procedure C... 
const W =... 
var... 

begin (* C *) 


end; (*C*) ge 3 
begin (* Main *) By 


end. œ Main *) 


6-28. Inthe program above, there are scope blocks. 


ae d C; 
b 3 ' f 


6-29. Which of the following is a global constant? 

a. x CEZ 

D y ; d. w 
6-30. The scope of y ranges from _ ; 

a. line ‘Procedure A' to ‘end; (* A *)' 

b. line 'Procedure B' to ‘end; (* B *)' 

c. line ‘Procedure A' to ‘end; (* C *)' 

d. the beginning to the end of the entire program . 


6-31. Stubs are : 
a. very short programs c. empty procedures 
b. outlines of the main programs d. subprograms of subprograms 


6-32. Why are stubs useful in structuring programs? 
a. They are used to test data before it is inputted. 
b. They allow the programmer to test each procedure or function as it is being added on 
to the program. 
c. Using stubs, there are fewer procedures and functions to write. 
d. They are used to test statements in the body of the main program for syntax errors. 


ae ee eee oe 





6-33. 


6-34 


6-35. 
6-36. 


6-37. 


6-38. 
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Passing parameters via value parameters protects the program from 
a. invalid parameter lists c. logic errors . 
b. syntax errors d. side effects 


What is meant by call by location ? 

a. The procedure is given the address of that variable in the calling program. 

b. The value of the variable is passed to the procedure. 

c. A separate memory location containing a copy of the value of the actual parameter is 
used. 

d. The same variable is used. 


Questions 35-38 refer to the following program: 


program sample (input, output); 
var A, B, C: integer; 
procedure exercise (var x, y: integer); 
var C: integer; 
begin (* exercise *) 
xt= leyt=2; C:=3; 
writeln(x, y, C); 
end; (* exercise *) 
begin (* program *) 
A:= 4; B :=5; C := 6; 
exercise(A, B); 
writeln(A, B, C); 
end. (* program *) 


a. 456 c. 453 

b. 126 ; d. 123 

What is the output of the second writeln (‘writeln(A, B, C)')? 
a ASS CASS 

b- T20 Fj di- 323 


What will be the output of the program if the procedure heading is changed to the 
following? (The 'var' is left out) 
procedure exercise (x, y: integer); 


a: Feo e 123 
126 456 
b. 456 di- E26 
453 456 


What will be the output of the program if the procedure call is changed to: 
exercise (A, A)? 
a, 223 pi 


256 
b- F23 d. 
456 


ANN 
ANNUK 
Aww 
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Questions 39-41 will refer to the following program: 


program ChangeNumbers (input, output); 
var Temp, A, B: integers; 
procedure Flip (var x, y: integer); 
var Temp: integer; 
begin (* Flip *) 
Temp := x; 
key, 
y := Temp; 
end; (* Flip *) 
begin (* ChangeNumbers *) 
A:= 2; B := 3; Temp :=5; 
Flip (A, B); 
writeln (A, B, Temp); 
end. (* ChangeNumbers *) 


6-39. -The Ae of Ais 











a. e 3 

PARE) 4. 6 
6-40. The value of Bi 

a 2 : OIER 

ba d. 6 
6-41. The value of Temp is 

a2 e253 

b. 3 d. 6 
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~ An AP Computer Science course emphasizes programming methodology, algo- 
rithms, and data structures. It utilizes a programming language to illustrate and 
implement these three areas. The language Pascal was chosen as the A.P. language 
because it is well suited to the objective of writing logically structured, well-docu- 
mented computer’ programs. Its high degree of modularity facilitates structured 
programming. z ; ; 
; Pascal features a number of general characteristics which are “desirable in a 
programming language: 

(a) It has relatively few. concepts and has simple and regular rules for combin-. 

ing them. A programmer can readily use these concepts as primitives in develop- 
ing algorithms. ; 


(b) Pascal enables programmers to write readable programs following good 
program design techniques. Algorithms can be directly translated into Pascal and 
the program structure will reflect the underlying logical structure of the algorithm. 


(c) The two previous characteristics simplify the process of program verifica- 
tion. Verifying that a program, or a portion of one, is correct (that it does what it is 
intended to do) not only can guarantee good results but also can reduce costly 
program maintenance when the program is used over a period of time. 


(d) When we restrict our attention to standard Pascal (rather than to any of z 
the versions, such as UCSD Pascal, which have supplemental features) we find 
that the language is well documented, reliable and reasonably efficient. Pascal 
syntax can be specified succinctly using syntax diagrams or BNF (Backus-Naur 
Form) grammar notation. 


(e) Programs written in standard Pascal can be run on practically every computer 
for which a Pascal compiler exists. This transportability is also a desirable feature. 


Several additional features of Pascal result from design goals. These include 
block structuring, data-type treatment and sequence-control structuring. 
Pascal is a block structured language. Its main program is a block which ` 
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contains subprograms (also called blocks). The structure of each block includes, in 
order, a header with specifications of parameters and results, a declaration section 
(labels, constants and type definitions, and variable identification), subprogram 
(functions and procedures) definitions, and an executable part. 

Pascal provides types such as integer, real, character, enumerated, Boolean, 
array, record, sequential file, and set. Also, pointer type and an operation to create 
new data objects of any type allow new linked data objects to be formed dynami- 
cally, during program execution. A programmer can define a data type and then, 
separately, declare variables to be of that type merely by giving the name of the 
new type. Definitions do not have to be repeated. : 

Although programmers infrequently need to use them, Pascal allows labels 
and GOTO statements to control the sequencing of statements in a program. 
Instead, structured control statements are utilized, such as compound statements, 
three kinds of iteration statements, and conditional and case statements. This 
facilitates the implementation of structured programming concepts. 


1. Program 
A program written in Pascal very simply consists of a program heading fol- 


lowed by a block followed by a period. 


PROGRAM ` 
= program heading ——_—— block———_—_. 


Program heading and block can be specified further as follows’: ` 


‘ PROGRAM HEADING ` 








———— PROGRAM identifier———> |—— >identifiers—>} —— 
IDENTIFIER 


——> letter ——___—__——> 
letter x 
digit 


LABEL ———> unsigned integer ————>;; 


BLOCK 










CONST —> identifier — = —> constant —>; y 


IPES > identifier —> = ——> type —i) 


VAR wi identifier [ie ee 


block ———— j 
Forward 













i 


PROCEDURE — identifier — parameter list 
FUNCTION —— identifier —> parameter list —> : —>type 
identifier 


E 
BEGIN a aang gs — END ———— 


+ . 


Syntax diagrams are collected in Appendix B. 


a 
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2. Block 


As indicated in the syntax diagram of block, both procedures and functions in 
Pascal consist of blocks. Identifiers defined in procedure or function blocks are 
called local variables; those defined in the program are called global variables. An 
identifier’s scope, or true range of meaning, is the block in which it is defined (and 
all blocks nested in that block). A local variable (type, value, subroutine) takes 
precedence over a global variable (type, value, subroutine}. 

The program block is activated when the program is run; procedure and func- 
tion blocks are activated when called. Local and global variables are allocated, but 
left undefined, when a block is activated. They are considered deallocated when 
the block’s activation is terminated, that is, when its statement part has been 
executed. Values assigned to variables exist only while the blocks in which the 
variables are defined are activated. 


Identifiers declared in are defined only in 








Program A 
Procedure B Program A A,B,C,D,E,F,G, 
Procedure C Procedure B B,C,D 
Procedure C C.D 
’ Procedure D LD 
Procedure E . Procedure E ` E,F,G 


Procedure F A 
Procedure G G 


Variable x declared in program A is defined 
everywhere except procedures B, C, and E. 
Variables y and z in program A are defined 
Var x,y,z everywhere. 

Variable v in procedure B is defined only in 
procedure B, but not in procedure C. 





r 


Program A 


Procedure B Variable x in procedure B is defined in proce- | 
Var v,x dure B and C. It is different from the x in 
program A. 
Procedure C Variables u and v in procedure C are defined: 
Var u,v in procedure C only. The variable v is differ- 


ent from the v in procedure B. 
Variable s in procedure D is defined in proce- 
dures D,E, and F. 
Variable t in procedure D is defined in proce- 
Var s,t dure D and E but not in procedure F. 
Procedure E Variable x in procedure E is defined only in 
Var x procedure E. It is different from the x in proce- 
- dure B and the x in program A. 
Variables t and u in procedure F are defined 
only in procedure F. The t in procedure F dif- 
fers from the t in procedure D and the u in F 
differs from the u in procedure C. 


Procedure D 


Procedure F 
Var t,u 





. - 
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3. Identifiers 


An identifier consists of one or more letters and digits, the first of which must 


be a letter. For clarity and readability, an identifier should convey an indication of 
its use. For example, Payrate or Month are better choices of identifiers for, respec- 
tively, an employee’s hourly pay rate and the month (e.g., January, February, ..., 
December} than P or M, even though the latter two identifiers are valid. Well- 
chosen identifiers can help document a program. 


4, Data Types 


` A data type defines the set of values a variable may assume. Pascal has four 
predefined or primitive data types: integer, real, character, and Boolean. 

a. Data objects of type integer are positive or negative whole numbers (e.g., 43, 
—695). A predefined identifer maxint has a value set to the largest integer that can 
be represented in a specific computer. Arithmetic operators provided for integers 
are + , — , *, div, and mod. All arithmetic operators, including div—which returns 
the integer value of the quotient (10 div 4 = 2}—and mod—which returns the 
remainder (10 mod 3 = 1), are subject to the rules of arithmetic precedence (see 
Appendix C). Relational operators are = , < >, <,>, <=,and p 


b. Data objects of type real are decimal numbers (e.g., 3.5, —0.4663, 957.482). 
At least one digit must appear on each side of the decimal. Scientific notation is 
also legal. For example 21e4 and —3.22e—2 are the same values as 210000.0 and 
—0.0322, respectively. Arithmetic operators provided for reals are +, —, *, /. div 
and mod can not be used with reals. There is no exponentiation operator. Rela- 
tional operators are = , <>,<,>,<*, and >=. Integer and real operands can be 


mixed to produce a real result. 


c. Data objects of the type char are each one character from the set of available 
characters (including upper and lower case letters, punctuation marks, digits, and 
other special symbols) in a specific computer implementation (e.g, ‘E’, ‘*’, ‘5’). 
Each implementation orders the character set so that relational operators =,< >, 
<, >, <=, and >= are available. 


d. Data objects of the type boolean have values true and false. Boolean data 
cannot be read in as data, but can be printed. The operators not, or, and and take 
the Boolean operands. Also, relational operators =, < >, <, > , <>, and >= may 
have Boolean operands. 

These four data types are scalar data types because the set of values in each is 
ordered. The types integer, char, and boolean can also be referred to as ordinal data 
types because each value has a unique predecessor (except the first) and unique 
successor (except the last]. pred and succ are predeclared functions which will 
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return these values. For example, pred (’D’) returns ‘C’ and succ (’D’} returns ‘F’. 
real is not an ordinal type because unique predecessors and successors cannot be 
specified for real numbers. 


e. Additional ordinal types, called enumerated types, can be defined by pro- 
grammers. To do so, the constants which make up the type must be listed. The 
listing gives the ordering of constants so that pred and succ can be used. 


EXAMPLES 


var 
- Meat: (Beef, Pork, Liver, Chicken); 
Cheese: (Edam, Brie, Cheddar) 


Each element within parentheses is a constant, not a variable, of an enumer- 
ated type. An assignment 
Meat := Liver; 
is valid. The use of ‘Liver’ would be invalid because Liver is a constant of type 
Meat, not a character string. Edam, Brie, and Cheddar may be used, for example, 
as elements in a case statement.’ Also, pred|Brie) returns Edam and succ| Brie) 


returns Cheddar. 
These user-defined types have meaning only within a program. They cannot 


be read in or written out directly. 


f. Pascal also allows definitions of subrange data types. These types enable a 
programmer to specify a portion of an already existing type.’ 


EXAMPLES 
subrange type values included 
var Posint : 1..10; z 1,2,3,4,5,6,7,8,9,10 
Grades : AE; ; AER OB EE 
Bigmeat : Beef. . Liver; Beef,Pork, Liver 


Use of subrange types can improve program readability. Also, automatic range 
checking is implemented; assignment of a value outside the defined subrange, for 
example, Grade := G, will cause a runtime error message. 

Subrange and enumerated type declarations ċannot be used in formal parame- 
ter lists of procedures and functions. However, subrange and enumerated types can 
be passed as parameters provided a type definition is first declared and then the 
variable is defined to be of that type. 


EXAMPLE 


type 
Cheese = (Edam, Brie, Cheddar); 
Decimal = 1. .10; 

var 
DairyProd : Cheese; 
Posint : Decimal; 
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5. Variables 


A variable identifies a memory location in which a value can be’stored. Pascal 
variables must have a name, a type, and a value. Any legal identifier can be a 
variable name. The name and type are declared in the declaration part of the 
program. A value must be assigned in the program; there is no default option. 


EXAMPLE 


var 
Grade, Number : integer; 
Mean, Median : real; 
LetterGrade : char; 
ListEnd : boolean; 


Assuming these declarations, the following assignments are legal: 


Grade := 98; 
Mean := 72.5; 
LetterGrade := ‘A’; 
ListEnd := false; 


6. Constants 


r 


A constant may be a number (integer or real), a character, or a string of 
characters. Constants used in expressions (e.g. 5, —16.3, ‘R’, ‘HELLO’) are called 
literal constants. Those specified in the declaration section of a program are called 
named constants. Examples of the latter include: 


const Blank = ‘'; 
Minimum = 10; . 
Border = Pd ease ee a 
Root = 3.333; 


The value on the right hand side of the definition must be a literal constant. 
Named constants may be used in-expressions. For example, Max := Minimum + 
40; is legal. Declaring constants facilitates program readability and changes. To 
change Minimum to 5 throughout a program, only the constant definition would 
need to be changed. 


7. Assignments and Expressions 


Expressions consist of variables, constants, and operators. They appear only on 
the righthand side of an assignment statement. For example, consider the expres- 
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sion Max := Minimum + 6 * (I + 5); is a legal assignment statement. Both sides of 
| must be of the same type (an exception: the right- 


the assignment operator | := 


hand side may be integer whe 


a single variable or a function identifier. 


n the lefthand side is real). The lefthand side must be 


8. Input/Output 


Four required procedures enable 


operating environment. read 


gets values 


communication between a program and its 


for variables in its argument list. write 


causes variable values to be printed. readin gets values similar to read but also 
discards the entire line of input: writeln performs a write but also causes the next 
output to be printed on the next line. ; 


‘ 


EXAMPLE 1 
. Statements Results and pointer position 
write (‘AP ’); AP 
; T 
write (COMPUTER '); AP COMPUTER 
write (‘SCIENCE ’); AP COMPUTER SCIENCE 
: ' 
writeln; AP COMPUTER SCIENCE 
t 
EXAMPLE 2 


writeln (‘AP COMPUTER. SCIENCE’) ’ AP COMPUTER SCIENCE 


Assume three lines of data each ending with an end-of-line character 


11 22 33 <eoln> 
44 55 66 <eoln> 
77 88 99 <eoln> 


Assume P, Q, R, and S have been declared integers 


EXAMPLE 3 


Statement 


read|P,Q); 


- read{R,S); 





Values read 


P:11 Q:22 


R:33 S:44 


Pointer position 


11 22 33 <eoln> 
tT 
44 55 66 <eoln> 
t 
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EXAMPLE 4 

readIn{P); 3 Pill. | 44.55 66 <eoln> 
read(Q); Q:44 ye 55 66 <eoln> 
readIn{R); R:55 me 88 99 <eoln> 
read(S); S77 : ye 88 99 <eoln> 

T 
Assume the same data but P,Q are declared characters and R,S are integers. 

EXAMPLE 5 

read(P,Q); PY OY 11 22 33 <eoln> 
read{R,§); R:22 $:33 k 22 33 <eoln> 

: T. 

EXAMPLE 6 

readln(P}; ; Pa" - : 44 55 66 <eoln> 
read(Q); 7 E. l a 55 66 <eoln> 
readIn(R); R:4 © 3 k 88 99 <eoln> 
read(S), $:77 | i = 88 99 <eoln> 


A 


9. Begin—End 


The begin-end pair delimit a compound statement. This enables a programmer 
to include a sequence of statements in a control structure or in the executable 
statement section of the program. Because begin-end delimits one statement, a 
compound one, the pair must be used whenever more than one single statement is - 
to be executed as part of the structure of a block. 





CHAPTER 3 


[ 
l 
i 
l 
freatures of the Programming 


ILanguage (II) 


b 


10. Control Structures 


Pascal implements three basic control structures: sequence, selection and 
repetition. 
; a. Sequence ; : 
© This structure involves a succession of statements-to be executed in the se- 
quence or order in which they appear. The begin-end pair is used to delimit the 
sequence, hence to make it one compound statement. : 


EXAMPLE 
begin Left := 0; 

Right := 2; 

Midpoint := (Left + Right) div 2; 
end; 


b. Selection 
This type of control structure provides a decision-making ability based on a 
condition being true or false. One structure is called an if-statement and has two 


forms: 
(i) if boolean expression (ii) if boolean expression 
then statement 1; then statement 1 
statement 2 else statement 2; 


statement 3 


In (i), when the boolean expression is true, then part (statement 1) is exe- 
cuted, followed by statement 2. When the Boolean expression is false, only 
statement 2 (not statement 1) is executed. In fii), when the Boolean expres- 
sion is true, the then part (statement 1) is executed followed by statement 3. 
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When the Boolean expression is false, the else part is executed followed by 
statement 3. Any of the statements may be compound statements. Struc- 
tures may be nested, i.e., part of a compound statement. 


EXAMPLES 

1) if Grade > 70 only when the value of 
then writeln (‘Pass’); Grade is greater than 

Grade := Grade + 10; 70 will ‘Pass’ be printed. 
In all cases the statement 
Grade := Grade +10 will 
be executed. 

2) if Grade > 70 ; When the value of Grade is 
then writeln (‘Pass’); greater than 70, execution 
else writeln (‘Fail’); will be as in 1). When 

Grade := Grade + 10; Grade is less than or 


equal to 70, ‘Fail’ will 

be printed and then 

Grade := Grade + 10 will 
be executed. 


A second selection contro] structure is called a case statement. This state- 
ment allows alternate choices to be executed at runtime. Each case statement has 
a corresponding nested if statement which has the same effect. 


EXAMPLE 


case Grade of 
100,90: writeln (‘High pass’); 
80 : writeln (‘Pass’); | 
70 : writeln (‘Marginal’); 
60,65 : writeln ee ) 

end; 


Assuming Grade can take E the values 50, 60, 70, 80, 90, | or 100, then the 
case statement has the same, effect as 


if (Grade = 100) or (Grade = 90) 
then writeln (‘High pass’) 
else if Grade = 80 
then writeln (‘Pass’) 
else if Grade = 70 
` then writeln (‘Marginal’) 
else if (Grade = 60) or (Grade = 50) 
then writeln (‘Fail’); 


A case statement is generally more readable than its corresponding if state- 
ment. One caution when using case: if the case selection or scalar variable (Grade 
in the example} can take values not in the case label list (50, 60, 70, 80, 90, or 100 
in the example}, then the result is dependent on a specific Pascal implementation. 
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Syntactically, the case label list values must be of the same type as the case 
selector or scalar variable. Type real cannot be used. Also, the action to be per- 
formed (writeln in the example) can be a simple or compound statement. 


3 ‘ 
c. Repetition 
This type of control structure provides the capability of repeating the execu- 
tion of a statement. There are three forms of the structure. 


(i) while Boolean expression do {*while statement“) 
statement 1; 

statement 2; 

(ii) for variable := initial value to final value (*for statement*) , 
do statement 1 

statement 2 

(iii) repeat (*repeat statement") 
statement 1 


statement n 
until Boolean expression 


In the while statement, statement 1 (a simple or compound statement) is 
executed as long as the Boolean expression is true. When it becomes false, then 
statement 1 is omitted and statement 2 is executed. To prevent an infinite loop 
from occurring, statement 1 must include a means of changing the one or more 


values in the Boolean expression so that it can become false. The Boolean expres- 


sion is tested before the loop is entered, i.e., before statement 1 is executed. 

In the for statement, statement 1 is executed an exact number of times, 
namely (final value — initial value + 1). Only then will statement 2 be executed. 
Both initial value and final value can be expressions. When initial value < final 
value, the reserved word to is used. When initial value > final value, then downto 
is used. The control variable must be declared in the variable declaration part of 
the program and must be ordinal type (and therefore cannot be real). It is undefined 
after normal termination of the for statement. 

In the repeat statement, all statements are executed at least once. The Boolean 
expression is tested after execution of the statements. The statements are then 
repeated until the expression becomes false. The begin-end pair may be used as 
delimiters for statement 1... statement n but are not necessary; repeat-until act 
as delimiters. If the Boolean expression can never become false, then an infinite 
loop will result. 

All three programs achieve the same result for n >= 1. For example, if n = 5, 
then the sum of the first five integers or 15 is output. Because the repeat statement 
tests after the loop, both assignment statements will be executed one if n = 0 
initially. If n < 0 initially, neither the while nor the for statements will be exe- 


cuted but the repeat statement will cause an infinite loop. 
t 
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EXAMPLE 


program ExRepeat; 
var h,n:integer; 


program ExFor; 
var i,h,n:integer; 


program ExWhile; 
var h,n:integer; 














begin 

read (n); 
write (n); 
h:= 0; 















while n > 0 do for i := n downto 1 do repeat 

begin h:=h+n; h:=h+n; 
h:=h+n,; et pk 
n:=n-1 until n = 0; 

end; 





writeln įh) 
end. 


PN 11. PROCEDURES 


Procedures are subprograms which implement the block structure nature of ` 
Pascal. / 


—— > PROCEDURE ————> identifier --——————>; ——> block —>; 


parameter list 


Comparing these representations to corresponding ones for program indicates 
the basic difference is that a procedure begins with a procedure heading andends with 
a semi-colon, while a program begins with a program heading and ends with a period. 
Within the heading section the differences are (i) procedure'rather than program and 
(ii) one or more formal parameter sections rather than one or more identifiers signify- 
ing external files. The block section is the same for both except that variables declared 
within a procedure are local to that procedure {and any procedure or function called 
from the procedure) and are not defined outside the procedure. 

A procedure call in the main program or in another procedure must specify all 
items in the procedure heading. In particular it must specify parameters in the 
parameter list. Variable names used in calling a procedure are termed actual pa- 
rameters, those in the procedure heading are termed formal parameters. Actual 
and formal parameters must match in number, type and position within the pa- 
rameter list. When the procedure is called in a program, logical control is passed to 
the first statement in the first begin-end pair of the procedure. When execution of 
the procedure ends, control is returned to the statement immediately following 
the procedure call. oes, 

Procedure parameters are specified in two ways: by value and by reference. 
The latter are defined in a var declaration in the procedure heading and are called 
variable parameters; the former are not included in a var declaration. For each 
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variable parameter, the actual parameter establishes a memory location and the 
formal parameter references the same location. Both a value and an address can be 
passed. Thus a change in a value of a formal parameter also changes the actual 
parameter value. For each value parameter in the procedure heading, a new mem- 
ory location is maintained, different from that of the actual parameter. When the 
procedure is called, a copy of the value of the actual parameter is stored in the 
address of the formal parameter. Only the value, not the address, is passed. Any 
change in the formal parameter’s value will not change the actual parameter’s 
value. 

If communication between a procedure and any other part of the program is 
maintained through parameters, there is less chance of hard-to-detect errors called 
side effects. These occur when changes in values in one program module affect 
another program module without the values being passed as parameters. A global 
parameter directly referenced (not through a parameter list) within a procedure, 
although legal, can cause undesirable side effects elsewhere in the program. 


EXAMPLE 


program TwiceSum (input, output); 

var n, Total: integer; sa 

procedure AddUp (k: integer; var Sum: integer); 
var i: integer; 


begin . = 
k= 2*k; 
Sum := 0; 


fori := 1 tok 
do Sum := Sum + i; 

end; 

begin 
nes = à 
AddUp |n, Total); 
writeln (Total) 

end. 


The procedure AddUp is called with n = 5. The value parameter k also 
becomes 5. The variable i is a local variable defined only in the procedure. In the 
first executable statement of the procedure k gets the value 10 (but n does not 
change). The for statement is executed 10 times and the resulting value of Sum is 
1+2+3...+ 9 + 10 = 55. This value is passed to Total because Sum is a 
variable parameter. The output of the program is 55, the value of Total. After the 
procedure terminates, n = 5 but i and_k do not have values. 


12. Functions 


Pascal implements both built-in functions and user-defined functions. A user- 
defined function returns a value to the calling program. The function’s name is the 
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variable identifier which transmits the value. Because a function acquires a value, 
it must have a type. In all other aspects a user-defined function is similar to a 
procedure. Syntax diagram for a function is as follows: 


— » FUNCTION — identifier ——————— : type —> block —>; _ 


parameter list 


EXAMPLE 


The previous procedure could be written as a function. 
program TwiceSum (input, output); ° 
var n: integer; ; 
function Sum (k: integer) : integer; 
var i, TempSum: integer; 
begin 
Lk = Zek; 
‘TempSum := 0; 
for i := 1 to k 
do TempSum := TempSum + i; 
Sum := TempSum 


end;. 

_ begin ; 
n:= 5; l 
write (Sum{n)} 

end. a 


„Note: write (Sum(n}); could be replaced by Total := Sum|{n); 
write (Total) 


13. Recursion 


Recursion is one of the powerful features in Pascal. Recursion refers to the 


ability of defining an object in terms of itself. Procedures and functions in Pascal 
can be recursive—they can call themselves, For example, in the previous function 
Sum, if Sum had been substituted for TempSum, than the function would have 
called itself each time a statement involving Sum was executed. 

Recursive functions can be written to do exponentiation (for example, x"), 
factorial, and Fibonacci number generation.* Such functions may be appropriate to 
illustrate how recursion works, but in each case a more efficient non-recursive 
function can and should be used. There are examples, such as tree traversals, 


3For a discussion of recursion-and iterative methods of generating factorials and Fibonacci 
numbers see Appendix C, z 


pa 
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variable parameter, the actual parameter establishes a memory location and the 
formal parameter references the same location. Both a value and an address can be 
passed. Thus a change in a value of a formal parameter also changes the actual 


' parameter value. For each value parameter in the procedure heading, a new mem- 


ory location is maintained, different from that of the actual parameter. When the 
procedure is called, a copy of the value of the actual parameter is stored in the 
address of the formal parameter. Only the value, not the address, is passed. Any 
change in the formal parameter’s value will not change the actual parameter’s 
value. 

If communication between a procedure and any other part of the program is 
maintained through parameters, there is less chance of hard-to-detect errors called 
side effects. These occur when changes in values in one program module affect 
another program module without the values being passed as parameters. A global 
parameter directly referenced (not through a parameter list) within a procedure, 
although legal, can cause undesirable side effects elsewhere in the program. 


EXAMPLE 


program TwiceSum (input, output); 

var n, Total: integer; ; 

procedure AddUp |k: integer; var Sum: integer); 
var i: integer; 


begin .. 
k <a Bsk 
Sum := 0; 


for i := 1 to k 
do Sum := Sum + i; 

end; 
begin 
SMS 

AddUp |n, Total); 

writeln (Total) 
end. 


The procedure AddUp is called with n = 5. The value parameter k also 
becomes 5. The variable i is a local variable defined only in the procedure. In the 
first executable statement of the procedure k gets the value 10 (but n does not 
change). The for statement is executed 10 times and the resulting value of Sum is 
1+2+3...+ 9 + 10 = 55. This value is passed to Total because Sum is a 
variable parameter. The output of the program is 55, the value of Total. After the 
procedure terminates, n = 5 but i and_k do not have values. 


12. Functions 


_ Pascal implements both built-in functions and user-defined functions. A user- 
defined function returns a value to the calling program. The function’s name is the 
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Towers of Hanoi problem, and quicksort method for which recursion provides a 
more natural and easier to comprehend algorithm. 


14. Textfiles 


A textfile is a file of single character components. It is read and written one 
character at a time. Other kinds of files exist in Pascal and will be discussed later. 
A textfile must be typed as a file of character, or, more simply, text. For example: 


var Filel, File2; text; 


specifies Filel and File2 as textfiles. 
The required procedures read, readIn, write, and writeln may be used with 


textfiles. For example: 


read (File1, ch); 
write|File2, ch); 


will read the next character from Filel and write it to File2. In order for this to 
happen, Filel must be opened for reading and File2 must be opened for writing. 
The standard procedures for doing this are: 


reset (File1); 
rewrite|File2); i , 


reset may also be used to return to the beginning of the file so that it may be 
reread. 

There are two standard external text files which allow communication be- 
tween the program and the outside world in Pascal. They are input and output. 
They are always open for reading (input) and writing (output). Also, if no file name 
appears in a read or write, for example, then the files input and output, respec- 
tively, are assumed. 

Textfiles, which are external files (including input and output) must be in- 
cluded within the parentheses in the program header. (NOTE: This is not neces- 
sary in UCSD Pascal or in microcomputers.) With the exception of input and 
output, they must be declared as files of type text in the var declaration section. 
Textfiles which are internal files only (files which exist only during program exe- 


cution and which cannot communicate or be stored outside of the program) must 
also be declared but are not included in the program header. 


- Textfiles (which are input files) contain an end-of-file marker (eof) and end-of-. 


line markers (eoln). When a file is opened for reading, eof and eoln are set to false. 
eoln becomes true if the next character to be read is the <eoln>. eof becomes true 
when no more data is in the file; thus a loop beginning with while not eof could be 
used to read every character in a file, one at a time. . 
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Set #2: Features of the 
Programming Language (I) 





Directions: For each of the following decide which is the best choice. 


Questions 1-3 are based on the following program structure: 


program A; 


| procedure G; | ; 


procedure D; 


begin 
B; 


E; 


end; 





1. A variable declared in procedure B and only in procedure B is accessible in 
(A) procedure B only 
(B) procedure’B, C © 
(C) procedure B, C, E 
(D) procedure B, C, D 
(E) procedure B, C, D, E 
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2. A variable declared in procedure D and procedure D only is accessible in 

{A} procedure D only 

(B) procedure B, C, D 

(C} procedure B, C, D, E 

(D) procedure B, C 

(E) procedure D, E 


3. Which describes the order in which the procedures in program A are called? 
(A) D, C, D 
(B) D'BE: D 
(C). D, B, E BD 
(D) B, C, D, E, B 
(E] D EGD- 


CBE 
C, E ; 
C.D. E, j 


~ 


~ 


4. -Which of the following expressions are equivalent? 


I. not (X and Y) 

II. not X or not Y 
II. X and not Y 
IV. not X and not Y 


(A) I&I 

(B) 11 & Il 

(C) I&A II 

(D) I & IV . 
(E) I & IV 


5. The Boolean expression not A and B or C is equivalent to 
(A) ((not A) and-B) or C 
(B) not A and (B or C) 
(C) (not A) and (B or C) 
(D) not (A and B) or C 
(E) not (A and B or C) 


6. Which of the following evaluates to 1? 
(A} 6 + 4 div 10 


(B) 2 — 9 div 5 

(C) 8 — 3 mod 2 

(D) (5 — 2) * (2 — 1) mod 3 
(E) —15 div 2 


7. Which of the following is equivalent to trunc { — 2.6} 
(A) trunc {.2E — 02) 
(B) round | — 1.2) oo $ 
(C) round | — 1.5) 
(D} trunc | — 3.9) 
(E) round | — 2.7) 


E 
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12. Which of the following is an illegal statement? 
(A) Age := pred(|Teens); 
(B) Teens := Teens + 1; 
(C) Tone := succ(Blue); 
(D) Age := Age mod 2; 
(E) Birthday := Birthday + 1; 


13. Which of the following is an illegal statement? 
- (A) Flag := 7 = 5; : 
(B) Birthday := Age + pred|Teens); 
(C) Age := Answer; 
(D) Answer := ord|Red) mod 5; 
(E) Result := ord(Green) div 2; 


14. Which of the following is an illegal statement? 
(A) Result := pred|Answer); 
- (B) Result := Birthday + 10; 
(C) Teens := pred(Birthday); 
(D) Result := ord|Green); 
(E) Hue := succ|Blue); 


15. Which of the following is an illegal statement? 
(A) Tone := Orange; 
(B) Hue := ord(Blue); 
(C) Flag := odd(Birthday); 
(D) Result := trunc|Answer); 
(E) Answer := ord(|Blue); 


Use the following program fragment to answer questions 16-20. 


var 
Choice : char; 
XYZ : real; 


ILJ,K, : integer; 


Given the following input from the keyboard: 
2 13 . 
689 


21 


NAWMN 


16. What is the result of: readin (Choice); 
writeln (Choice); 

(A) 12 
(B) 6 
(C) 5 

(D) 1 

{E) 7 


63 





64 . Multiple Choice Questions 


17. What is the result of: readin (X,Y,1,],K.); 
writeln {1,J,K) 
(A) 12 13 6 
(B) 13 6 
(C}-12.6%5 
iD}: 5.3 2... 
(E) 689 


18. What is the result of: for I := 1 to 5 do 
read (K); 
writeln {K}; 
(A) 12 13 689 
(B) 4 - 
(C) 9 
(D) 3 
(E) 6 


(| 
[l 
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19. What is the result of: for /:= 1.to5 do 
read (Choice); i] 
writeln (Choice); 
(A)9 ; 
(B) 3 i i] 
(C) 5. eS 
(D) 4 Ba . i 
{6 | a 
1 
A 
l 
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i 
(i 
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20. What is the result of: for I := 1 to 5 do 
readIn (Choice); 
writeln (Choice); 


(A) 9. 
(B) 3 
(C) 5 
(D) 4 
(E) 6 
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Directions: For each of the following choose the best answer. 


F 


A procedure with the following heading 


procedure Compute (var x : integer; y : integer); 


cannot be called by 

(A) Compute (Time, Distance; 

(B) Compute (Time * 25, Distance); 
(C) Compute (Time, Distance + 25); 
(D) Compute (Time, 205); 


‘(E) all of the above 


A function 


I. can stand alone as a statement 
Il. can be used as a variable in an expression 


': I. can bea replacement for an expression 


(A) I 

(B) I 

(C) Il 

(D) I&II 
(E) WS Ul 


A procedure may be 


I. referenced in an expression 
II. used alone as a statement 
III. assigned a value 


(A) I only 
(B) II only 
(C) HI only 
(D) I&I 
(E) I&AM 


4. The value a user defined function returns may be 


I. real : 
II. integer subrange 
Ill. character 
IV. boolean 
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(A) I only 

(B) 1, M & IV 
ACEA 

(D) H, HI & IV 

(E) I, I, Wl, & IV 


Answer questions 5-7 given the following program segment. 


for Index := 2 to N do 
if A[Index] < Max 
then writeln ("FOUND’); 


5. What is the minimum number of times that “FOUND” is printed? 
(A) N 
(B) O 
(C) 1 
(D) 2 
(E) N-1 


6. What is the maximum number of times that “FOUND” is printed? 
(A) N 
(B) 0 
(C) 1 
(D) N-2 
[E] N-1 


7. Which of the following could not be added to the action of the “then”? 
(A) Index := Index + 1; 
(B) writeln (A [Index]); 
(C) N := Index; 
(D) A[Index] := 0; 
(E) Max := 100; 


Use the following program segment to answer questions 8-11. 


program TwiceSum; . 
var n, Total : integer; 
procedure AddUp (k integer; var Sum : integer); — 


var i: integer z 
begin : 
k = 2°*k; 
Sum: = ay 
fori:= ltok roe 


Sek := Sum + i 


end; 
begin (*MAIN*) 

n= 2; 

AddUp |n, Total); 
writeln (Total) 
end. . 





10. 


11. 


12. 
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The program segment will writeln 


. (A) 4 


(B) 10 


(Sg 


(D) 0 7 
(E) 3 


When the procedure terminates, the value of n is 

(A) 4 

(B) 2; ' 
(C) 10 

(D) 1 

(E) 0 


The value of k after the procedure ends is 
(A) 4 

(B) 2 

(C) 10 

(D) 1 

(E) no value 


The value of i after the procedure ends is 


(A) 4 
(B) 2 
(C) 10 
DEA 
(E) no value 
The loop 
repeat 
Sum := Sum + i; 
= 1+ 1; 


until (i > 100); 
writeln (Sum); 


I. never terminates 
Il. must execute at least once 
Ill. terminates when i > 100 is true 


(A) I only 

(B) II only 

(C) HI only 

(D) H & Ill 

(E) none of the above 


Which of the following program segments will calculate the sum of integers 
from 1 to 100? 
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70 Multiple Choice Questions : 


I. begin 
Sum := 0; 
for i := 1 to 100 do 
Sum := Sum + i; 


end; 
Il. begin 
Sum :=0; |. 
for i := 100 downto 1 do ; 
Sum := Sum + i; 
end; 
Ill. begin -. 
Sum := 0; 


if i <= 100 then © 
Sum := Sum + i; 
end; 


(A) Ionly 
(B) II only 
(C) HI only 
(DJ I&II 
(E) HAHI 


l4. Given the following program segment, how many times will the function 
Fact be invoked, given a main block call: writeln (Fact (4);? 


function Fact (n : integer) : integer; 


begin 
if (n = 0) then 
Fact := 1 

else 


Fact := n *Fact (n — 1); 
end; (*Fact*) 


(A) 5 times 
(B) 4 times 
(C) 3 times 
(D) twice 
(E)- once 


15. Which is (are) true when a textfile is opened for reading? 
I eof is set to false 
Il. eoln is set to false 
Ill. eof is undefined 


(A) I only : : 
(B) II only : 

(C) HI only 

(D) I& HI 

(E) none of the above 


oh 
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16. 


17. 


18. 


19. 


Set #3: Features of the Programming Language (I1) 


Use the following function to answer questions 16-18. 


function Power (x : real; y : integer) : real; 
var 
z : integer; 
begin 
Product := 1; 
for z := 1 to y do 
Product := Product * x; 
Power := Product; 
end; 


The base of the power function calculated in function Power is 
{A} 1 z i s 


-(B) x 


(C) y 
(D) z 
(E) none of the above 


The exponent in function Power is 


(A) x 
(B) y 
ie Be 


4 


` (D) Power 


(E) none of the above 


A correct invocation of Power is 


IL writeln (Power(3,4)); 

I. write (Power (Power(3,2),4)); 
HI. Sum':= Power|Base,n) + Power|2,3); 
IV. Sum := Power|2,3.2) + Power(3.2,2); 


(A) I only 

(B) I&II 

(C) I&II 

(D) 1, 1 & H 

(E) all of the above 


The following loop will print 


for i := 1 to 100 do 
begin 5 
Count := Count + 1; 
i:= it 10; 
write (Count) 
end; 


(A) integers from 1 to 100 

(B) integers from 1 to 10 ` 
(C} integers from 1 to 100 in multiples of 10 
(D) integers from 1 to 1000 in multiples of 10 
(E) none of the above 
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Multiple Choice Questions 


20. To count the number of single males over 21 and all married males we can 


use the loop 


I. if (Sex = ‘male’) and (Married = ‘y’) or (Age > 21) then Count := Count + 1; 


Il. if ((Sex = ‘male’) and (Married = ‘y’)) or 


Ill. if (Sex = ‘male’) and ((Married = ‘y’) or 


(A) I only 
(B) H only 
(C) HI only 
(D) I&II 
(E) I&AM 





(Age > 21) then Count : 


[Age > 21)) then Count : 


Count + 1; 


Count rth 


Ñ 
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i Chapter 8 


lEnumerated Data Types, Subrange 
gDaia Types, and Arrays 


§ murtre CHOICE 


æ 8-1. 
i 
[l 


8-2. 


Which of the following statements about data types is true? 

a. All scalar data types are also ordinal data types. 

b. All ordinal data types are also scalar data types. 

c. The data type real is both a scalar and ordinal type. 

d. Avscalar data type has a unique predecessor and a unique successor. 


All of the following are examples of scalar data types, EXCEPT à 
boolean 


a. real E 

b. integer d. string 

In Pascal, the user can define a new ordinal data type by listing all the possible values for 
that type. This user-defined data type is known as a(n) data type. 

a. enumerated ; c. subrange . 

b. structured d. scalar 
A data type that contains a portion of a predefined ordinal or enumerated data type is 
known as a(n) data type. 

a. enumerated c. subrange 

b. struchmed —— d. scalar 

Which of the following is a valid enumerated data type definition? 

a. type 

cars : Ford, GM, Chevrolet; 
b. ty 


pe : 
vowels = ( a, e, i, 0, u); 

Cc. var 

bus : vehicle; 


type 
small number:= 1, 2, 3, 4; 
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8-6. Which of the following statements about enumerated data types is true? 
a. Anenumerated data type can only be defined in-a iype definition. 
b. The constants that make up an enumerated data type need not be ordinal. 
c. Enumerated data types can be directly read or written using ‘read’ and ‘write’ 
statements. 
d. The values used in enumerated data types must be unique; they may not be members 
of any existing data types. : 


Use the following program segment to answer questions 7 and 8: 
pe 
fruits = ( Apple, Orange, Grape, Pear, Lemon); 


8-7. What is the ordinal number of Lemon? 
aS G3 
b. 4 d. 6 


8-8. Based on the program segment above, which of the following expressions is valid and 
evaluate as true? ; > 
a. succ ( Apple) = Orange c. pred ( pred ( Pear)) = Grape 
b. Ord (Orange) + ord ( Apple) = 3 d. Orange >= Apple 


8-9. Which of the following statements about subrange data types is true? 
a. The base type of a subrange can be any scalar data type. 
b. Subranges list all the possible values in a type. — 
c. A subrange may have only standard data types as a base type. 
d. Subranges can be defined under type definitions. 


8-10. The values of the subrange type are 
a. all the values of the base type 
3 b. ali the-values of the base type that fall between two specified constants 
-._¢. either real numbers or integers 
d. of the type char 





8-11. Which of the following subrange declarations is valid? 
a. type 
Grade = 'A’...'F'; 


var 
Result ; Grade; 
b. type 
SmallNumber = 1..5; 
var 
LittleNumber : SmallNumber; 
c. type 
Days = (Monday, Tuesday, Wednesday, Thursday, Friday, Saturday, Sunday); 
Weekdays = Monday..Friday; 
var 
Workdays : Weekdays; 
d. 


pe 
Months = January..December; 
var 
YearDivision : Months; 





ee a 
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Use the following program segment to answer questions 12 and 13: 


program Sample (input, output); 


type 
SmallInteger = -10..10; 


var 
Big : integer; 
Little : SmallInteger; 
a, b : real; 
8-12. The type SmallInteger is a(n) data type. 
a. enumerated c. structured 
b. subrange d. standard 
8-13. Based w the program segment above, which of the following statements would be 
invalid? 
a. Big := Little; c. Little :=-11; 
b. readin (Big); d. Little := -9; 
Little := Big; 


=e 2 oe ee oe ee ee 


8-14, Is the following subrange definition valid? If not, why? 


type 
Number = -5.5..20; 
‘a. Itis valid. 
` It is invalid; the minimum value must be a positive value. 
. It is invalid; the minimum value is not an ordinal data type. 
. Itis invalid; the equal sign (=) should be replaced with a colon (:). 


am. 
AoE 


8-15. Variables of simple data types such as integer, real, byte, char, and boolean contain 


se single entity each 
. a collection of components grouped together 
. multiple values 


8 
Ao» 


. no values — 
! 8-16. A collection of components can be grouped together and referenced under a single 
| variable name using ; 
i a. simple data types c. subrange data types 
i . b. enumerated data types d. arrays 
8-17. Allof the following are structured data types, EXCEPT ; 
a. records c. sets 
b. folders d. arrays 


8-18. Whatis an array? 
a. Itis an ordered collection of elements, having a common variable name, and all of 
the same data type. 
b. Itis acollection of elements of different data types which are stored in adjacent 
memory locations. 
c. Itis a variable which contains multiple values of the same data type. 
d. Itis a location in memory which holds multiple values of the same data type. 
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8-19. var 
Score.: array [1..10] of real; ; 
Based on the declaration above, which of the following statements is true? 
a. Arrays may only be declared under variable declarations. 
b. The array, Score, has ten array elemenis of type integer. 
c. Each item in the array will be of the data type real. 


d. The compiler wiil associate nine memory locations with the name ‘Score’. 
8-20. Which of the following array declarations is valid? 
a. type 
Numbers = array [real] of real; 


pe 
SmallArray = array [1..3] of integer; 


var 

Score : SmallArray; 
c. var 

Students = array [1..100] of integer; 
d. type 


RealValues = array [1..25] of real; 
var 
RealValues : Score; 


8-21. Individual array elements in an array are referenced by using an array 








‘a. subscript c. variable ee 
b. sublist d. script 
8-22. The subscript identifies the of a particular array element in the array. 
a. location . c. range 
= b., value d. name 
8-23. The byte data type allows for subscripts in the range of ; 
a. Othrough25 - c. O through 255 
b. -25 through25 — : d. -255 through 255 


type ; 
Results = array [1..25] of real; 
var 
Score : Results; 


Score [25] := 20; 





The statement Score [25] := 20 means that the value has been assigned to the 
element of array ; ; : 

a. 25, 20th, Score c. 20, 25th, Results 

b. 25, 20th, Results d. 20, 25th, Score 


8-25. What array element does Score [x * 2 + 2] refer to? Assume that X is equal to 2. 
a. 6 G. 2 
b. 8 d. 4 


$ 
fi 
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f 
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Use the following program segment to answer questions 26-28. 


pe 

Numbers = array [1..10] of integer; 
var 

Sum : Numbers; 

x : integer; : 


begin 


San F <= 99.5 
writeln (Sum [2 * x + 4D; 


end. 


8-26. Sum [2 * x + 4] refers to array element F 
a. 16 ae 
b. 14 d; 10 





8-27. = sing the printed output of the writeln statement? 
a. 99. 


bo 
co. 3 
d. The writeln statement would generate an error message. 


8-28. If the writeln statement was changed to writeln (Sum [2 * x]); what would the printer 


output be? . 
a. 99.5 

b. 8 

c. 99 

d. The writeln statement would generate an error message. 
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8-29. The is the data which is stored at the memory location referenced by the array ` 





name and subscript. 
a. content of the array element c. array subscript 
b. value of the subscript d. subscript data type 


8-30. Evaluate the following statement, assuming the content of Sum [5] is 10, the content of 


Sum [3] is 5, and the content of Sum [4] is 3. 
Sum [5] := Sum [3] + Sum [4]; 
Which of the following statements is true? 
The value of the Sum [3] + Sum [4] is 10. 
The values stored in Sum [3] and Sum [4] remain unchanged. 
The value in Sum [5] remains unchanged. 
The value in Sum [5] is replaced by the value of Sum [7]. 


rete oe 
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8-3 1. To fill an array with data or print the values stored in an array, are used. 
a. files c. sets 
b. records d. loops 


8-32. Which of the following for loops would read in test results of fifty students and store 
them in an array called ‘Score’? 
a. fori:= 1 to50do c. fori:=1to50do 
read (Score[i]); read (Score); 
b. for Score := 1 to50 do d. for Score := 1 to 50 do 
read (Score[1..50]); read (Score); 
Score := Score + 1; 


8-33. Which of the following while/do loops would perform the same task as in question 32? 
* Assume that the following constant and variable declarations apply to all the loops: 
const 
MaxNumber = 50; 
var 
Score = array [1..50] of real; 
Count : integer; 


a. While (Count <= MaxNumber) do c. Count :=0; 
begin {while} While (Count < MaxNumber) do 
Count := Count + 1; begin {while}. 
readIn (Score[Count]); Count := Count + 1; ~ | 
end; {while} readin (Score[Count]); - 
end; {while} 
b. Count := 0; d. Count := 0; 
While ( Count <= MaxNumber) do While (Count <= MaxNumber) do 
begin {while} begin {while} 
Ceunt:=Count+1; ` readin (Score[Count]); 
a readin (Score[Count]); Count := Count + 1; 
end; {while} ~ end; {while} 


subprograms? 

a. The values of the elements of the array will not change. 

b. A complete copy of all the array elements will be made for the subprogram. 
c. This saves memory space. 

d. There will be fewer errors in the program. 


8-35. When character arrays are used, the compiler automatically converts them into ; 
a. words c. elements 
b. strings d. variables 


8-34. What is the advantage of using variable parameters when passing arrays to ij 
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- 8-39. 


8-36. 


8-37. 


8-38. 


8-40. 


8-41. 


8-42. 
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What does the following array declaration imply? 


Names = array [1..15] of char; 
var 

Name : Names; 
‘Name’ is an array containing : 
a. fifteen elements, each of type char 
b. fifteen names in fifteen adjacent memory spaces 
c. fifteen names, each consisting of fifteen characters 
d. characters numbering anywhere from one to fifteen 


Which of the following array constant definitions is valid? 
a. const . 
Numbers : array [1..20] of real; 
b. const 
Vowels = array ['A'..'U'] of char; 
c. const ; 
ShoeSizes = array [1..5] of integer : (3, 5, 7, 9, 11); 
d. const 
Weights : array [1..5] of real = (110.5, 101.4, 90.0, 145.5, 120.2); 


In Pascal, the number of dimensions in an array indicates the number of 
a. elements in an array 


- b. subscripts needed to access a single element within that array 


c. data types in an array 
d. memory locations used in an array 


An array with more than one index is called a array. 
-a. multi-subscript c. multi-dimensional 
b. . poly-index d. multi-index 
In a two-dimensional array, two are needed. 
a. elements c. values 
b. memory locations d. subscripts 


Which of the following will have the value of the element in the sixth column, second 


row of the two-dimensional array A? 
a. A[6,2] c. A[2,6] 


`b. A[6,6] d. A[2,2] 


Which of the following arrays can contain the greatest number of elements? 
a. Matrix = array [1..10, 1..5] of real; 

b. Count = array [1..20, 'A'..'C'] of integer; 

c. Position = array [{ ‘a’..'z'] of integer; 

d. Sum = array [-10..10, 1..5] of real; 
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8-43. Which of the following is the best declaration for a three-dimensional array called Book 
of type char that contains 100 characters per line in a 200-page book with 50 lines per 


page? ok 
a. type i 
Book = array [1..200, 1..50, 1..100] of char; 


b. type 
Book = array [0..200, 0..50, 0..100] of char; 


c. type 
Book = array [1..100, 1..200, 1..50} of char; 


d. type 
Book = array [1..109, 1..50, 1..200] of char; 


8-44, How would you read in values into each cell of the array Book in question 43 using a 


for loop? 
Assume the following type and var declarations apply to all the for loops: 


type 
pages = 1..200; 
line = 1..50; 
characters = 1..100; 
Booktype = array [pages, lines, characters] of char; 


var RA f 
page, line, character: integer; ; ee 
Book : Booktype; 


for character := 1 to 100 do 
for line := 1 to 50 do 
readin ( Book({page,line,character]); 
> b. for character ‘= 1 to 100 do 
; for line :=.1 to 50 do 
for page := 1 to 200 do 
readin ( Book(page,line,character]); 
c. for page := 1 to 200 do 
for line := 1 to 100 do 
for character := 1 to 50 do 
readin ( Book[page,line,character]); 
d. for page := 1 to 200 do 
for line := 1 to 50 do 
for character :='1 to 100 do 
readin ( Book[page,line,character]); 


a. for page := 1 to 200 do ; i 
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Records and Sets 


MULTIPLE CHOICE 


11-1. What is a record? 

a. Itis acollection of identically typed objects. 

b. Itis a structured data type which consists of a sequence of components, all of the 
same data type. 

c. Itis astructured data type that allows a group of related items, not necessarily of the 
same data type, to be referenced by a single name. 

d. Itis a structured data type that allows a group of items of the same data type to be 
referenced by a single variable. ` 


11-2. In which of the following cases is it most suitable to use a record? 
a. Alphabetizing a list of names and storing it so that the list may be used as input to 
different programs. 
b. Reading a list of characters and then manipulating them. 
c. Reading a list of exam scores and then determining the highest, lowest, and average 
scores. - ; - i 
` d. ‘Keeping a student file which includes the students’ name, I.D. number, sex, and 
` college standing. 


11-3, What is a record field? 
a. Itis adata item which is a part of a record, and it must be assigned a specific data 


type. 
b. It specifies the number of memory locations which are allocated toa record. 
c. It specifies the maximum number of characters allowed for each element of a record. 
d. It specifies the number of data items allowed in a record. 


Use the following record declaration to answer questions 4-6: 


Automobile = record; 
Model : string[15]; 
Year : integer; 
Color : string[10]; 
Price : real; 

end; 
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11-4. 


11-5. 


11-10. Which of the following program segments using the with statement is valid? 
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Is the record declaration above valid? If no, why? 

a. Itis valid. = 

b. Itis invalid; ‘Automobile = record;’ should not contain a semicolon (;). 

c. Itis invalid; the record type definition should not conclude with an ‘end’. 

d. Itis invalid; there should be a ‘begin’ after the first line of the record definition. 





‘Model’, ‘Year’, ‘Color’, and ‘Price’ are of a record. 
a. variables c. data types 
b. fields d. values 


Assume the variable Car is declared to be of type Automobile. How would you assign a 
value to data item ‘Color’? 


a. Car.Color := 'red'; c. Automobile.Car.Color = ‘black’; 
b. Color.Car := blue; ; d. Car.Color ='white'; 

Individual record fields are referenced by using a : 

a. subscript c. field selector 

b. index d. field index 


Which of the following comparisons of arrays and records is false? 

a. They both provide a way to give a single name toa list of values. 

b. Subscripts can be used to reference the individual elements. 

c. Elements in an array list must be of the same type; The component values of 2 record 
- may be of different types. ate ae 

d. Records are kept in secondary storage whereas arrays are not. 


Why is the with statement useful? 

a. It eliminates the need for referring to each field by both the recordname and 
fieldname; therefore, it is a simpler method of referencing to fields within a record. 

b. It allows the user to reference several fields simultaneously. 

c. It copies data stored in one field to another. 

d. It checks the contents of each field. 


` 


a. with Car do c. with Car do 
begin i begin. 
Car.Model : 'Ford'; Model := 'Chevrolet'; 
Car. Year : 1987; Year := 1987; 
Car.Color : ‘red’; Color := ‘blue’; 
end; end; 
b. with Car do d. ‘Car with Automobile do 
Model := 'GM'; begin 
Year : = 1986; : Model.Car :"Volkswagon’; 
Color := ‘black’; Year.Car : 1985; 
end; Color.Car : ‘green’; 
end; 








11-11. 


11-12. 


11-13. 
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How are records copied? 

a. Each field is copied, one at a time. 

b. All the fields of one record may be copied to another using a single assignment 
statement. 

c. There is a pre-defined copy function which will do the job. 

d. Each element of a field is copied, one at a time. 


A general rule when copying records is that both records must i 
a. be of the same record type c. contain the same number of elements 
b. beof the same data type - d. contain the same number of fields 


Assume there is an array of records for students and that each record contains the 
student's name, sex, social security number and college standing. How would you 
reference the name of the fourth student, assuming the following type declaration has 
been made? 


Students = record 
Name : string[30]; 
Sex : char; 

SSN : string[11]; 
Standing : string[9]; 


end; 
Student :array[1..10] of Students; 


var 
Undergrad : Student; eae 
a. Student[4].Name c. Undergrad.Name[4] 
` b. Undergrad(4].Name d. Student.Name[4] 


11-14. 


11-15. 


Referring to the declaration in question 13, how would you assign a value to the 


_ Standing field of the tenth record in the array Undergrad? 


a. . Undergrad.Standing[10] : Freshman’; 
b. Undergrad[10}-Standing := ‘Junior’; 

c. Undergrad[10}.Standing = ‘Sophomore’; 
d. Standing.Undergrad[10] : ‘Senior; 


Consider the following type declarations: 
Exercise = record 
Y1 : integer; 
Y2: char; 
end; 


var 
X1, X2 : Exercise; 
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11-25. Which of the following type declarations for a set is invalid? 


a. type | 
Fruit = (Apple, Orange, Grape, Pear, Banana); 
Fruits = set of Fruit; 


= : 
Score = set of 1..100; 
pe ] 
Numbers = set of integer; 
type 
SmallNumber = set of 0..5; 


11-26. A set which is assigned to the set means that it contains no elements. 


a. universal c. sera 

b. empty d. negative 
11-27. Allof the following are set operators, except ___ ‘ 

a. non-intersection (/) c. difference (-) 

b. union (+) d. intersection (*) 


11-28. What is the rule when two sets are being manipulated by set operators? 
a. The operands must be sets of the same type. 
b. The operands must contain the same number of elements. 
c. The elements in the sets can only be integers. ae 


d. The elements in the sets must be listed in their ordinal sequence. 
Determine the value retumed by each of the following expressions: 


11-29. [7, 8, 9] + [8, 1, 5] 


a. gE a a, 8, 8, 9] - = C. H; 5, T 9] 
b- 1159; Bee d. [8] l 
11-30. pas H 'd'] x ['e’, 'd', 'c'] 
a. [‘a', TR fas 4 'e'] c. ['d'] 
b. ‘a’, +) fa 'e'] d. [] 
11-31. [2, 4, 6, 8] - [8, 6, 4, 2] 
a. [] c. [2] 
b. (2, 4, 6, 8] d. (2, 2, 4, 4, 6, 6, 8, 8] 
11-32. Which of the following expressions evaluates as true? 
a. (a, ‘c', ‘d'} <= Ta, cl CG: <= [1, 2] 
b. [1, 2, 3, 4] >= [5, 4, 3, 2, 1] d. (7, 8,9] <> [8, 7, 9] 


11-33. Which of the following expressions evaluates as false? 
á [2 < 29. A 
b. (2; 4] 2% (2, 4, 5] 
C. [‘a’, hes C, 'd'] <= e, ‘a’, Ca bh ‘a'] 
d. {1,2} >= §) 
11-34. What set operator is used in the set membership test? 


a. + G <> 
bos d. in 
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The following type definition and variable declaration is an example of a record within a record: 


| pe 
StudentRec = record 
Name: string[30]; 
a SSN_: string[9]; 
Age: integer; 
end; 
Student = record 
| Address : string[40]; 
Person : StudentRec; 
Phone: integer; 
| end; 
var 
Undergraduate : Student; 


| Use the program section above to answer questions 19 and 20. 


| 11-19. How would you reference the undergraduate's address? 


a. Undergraduate.Person.Address c. Undergraduate.Address 
b. Undergraduate.Student.Address d. Student.Address 
D 11-20. How would you reference the undergraduate's age? 
a. Undergraduate.Student.Age. c. Undergraduate.StudentRec.Age 
b. Undergraduate.Person.Age Ci Dadepgradiage Age 


| 11-21. In Pascal, asetisa 
; a. collection of identically ntically typed objects which must have the same ordinal base type 
b. structured data type which consists of a sequence of components, all of the same data 


type . 
| c. structured data type that allows a group of related won not necessarily of the same 
data type, to be referenced by a single name 
d; structured data type that allows a group of items of the same data type to be 
i] referenced by a single variable 


11-22. The maximum number of elements in a set is ; 
| a. 200 eek c. 256 
b. 250 d. infinite 
11-23. Which of the following statements about Pascal sets is false? 
| a. A set exists as an indivisible unit. 
b. There is a limit to the number of elements in a set. 
c. The elements of a set need not necessarily be of the same type. 
| d. Individual set elements cannot be accessed and manipulated. 


11-24. The indicates what ei or range of values mias belong to a given set. 
a. ordinal type : c. scalar type 
b. base type d. array 
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11-35. The is a subset of every set. : 
a. universal set c. set of integers 
b. -empty set 3 d. set of char 


11-36. Itis more suitable to use the array rather than a set when i 
a. the order of the elements is important 
b. you need to manipulate the elements 
c. you want to be able to delete and add elements easily 
d. the need arises to use an entire array in a Boolean expression 


Use the following program segment to answer questions 37-39: 
Assume the input data is a series of integers separated by blanks like this: 1p1p0%1%0p1%1 


type 
DaysofWeek = (Monday, Tuesday, Wednesday, Thursday, Friday, Saturday, Sunday); 
Days = set of DaysofWeek; 
procedure ReadSet (var A : Days); j 
(* This procedure reads set values to a variable of type Days *) 
var 
B : Daysof Week; 
Code : integer; 
begin (* ReadSet *) 


A:=[]; 
for B := Monday to Sunday do 
begin 
read (Code); 
if Code = 1 
then A := A + [B]; 
end; (* for *) 


~ readin; 
_ .end; (* ReadSet *) _ k; 
H i 


11-37.. What is the value of A? : 

. [Monday, Tuesday, Wednesday, Thursday, Friday, Saturday, Sunday] 
. [Monday, Tuesday, Thursday, Saturday, Sunday] 

. [Wednesday, Friday] 

d. (Monday, Tuesday, Wednesday, Friday, Saturday, Sunday] 


ans 


11-38. How would you delete element ‘Monday’ from set A? 
a. A:=A- ‘Monday’; c. A=A- Monday; 
b. A:= A - [Monday]; d. A=A- [Monday]; 


11-39. Assume the if statement has been changed to: 
if Code = 0 
then A := A + [B]; 
What would the value of A be? 
.a. [Tuesday, Thursday] 
b. (Monday, Tuesday, Thursday, Saturday, Sunday] 
c. [Wednesday, Friday] 
d. [Monday, Tuesday, Wednesday, Thursday, Friday, Saturday, Sunday] 
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In a stack, items are added and removed only from the top. For instance, it 
would be possible to remove the 18 from the stack, or to place another number on ( 
top of the 18. However, the 6 cannot be accessed until the 18 is removed, and the 7 : 


cannot be accessed until the 6 is removed.. 


When a recursive call terminates, the previous local values are restored by 
removing them from the stack. By using a stack, the computer can call a recursive 


procedure over and over, and, as long as there is room on the stack, still restore 


any local values when the call terminates. 


The following is an outline of what occurs when Example is called. 


So long as X is greater than 0, the steps taken by Example are: 


1. Write the current value of X on the screen using the first writeln. 

2. Save the current value of X on the stack. 

3. Call Example recursively with the value X-l. 
When the call Example (2) is made, these steps are repeated twice: once for xX-=2 
and once for X = |. 


When X is equal to 0, the steps taken by Example are: 
1. Write the current value of X, 0, on the screen using the first writeln. 
(X is now 0 so no recursive call is made and the procedure skips to the 


second writeln.) 
2. Write the current value of X, 0, on the screen using the second writeln. 


3. Terminate the execution of this call to Example. 


This does not terminate the recursion. The following three steps are now repeated mate | 
I 


for cach value stored on the stack. 


When execution of a recursively called Example is terminated, the steps taken are: 


l. Restore the value of X from the stack. 
2. Write the value of X on the screen using the second writeln. 


3. Terminate the exccution of Example. 


If the original call was Example (2), these steps would be repeated twice, once 
restoring a 1 from the stack and printing it, then restoring and printing a 2. 


Based on this outline, it can be seen that a single complete call to Example 
consists of the execution of the entire procedure. This includes writing the value 
of X on the screen with the first writeln, saving the value on the stack, making a 
recursive call, restoring the value of X from the stack and writing that value on 
the screen with the second writeln. Of course, the recursive call is only made (and 
therefore any values saved or restored) when X > 0. A recursive call is considered 
to be terminated only after it has completed these steps and reached the end 


statement. 


To demonstrate how the stack is used during recursion, the calls produced 
by Example (2) are shown below with the stack: 


Call: Example (2) Z 
Example writes a 2 on the screen using the first writeln statement. 


The value of X, a 2, is saved on the stack. 


Aa 
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Chapter Three: Recursion 


On several occasions during your study of Pascal you have been introduced 
to examples of recursion; i.e. a procedure or function which calls itself. The first 
introduction occurred in Chapter 5 where recursive functions were used to 
calculate numerical values such as powers, factorials, and the roots of functions. 
Several graphics examples were presented in Chapter 10 including DrawTree which 
constructed 2 tree composed of smaller trees acting as branches, and DrawFigure 
which created nested figures. It is now time to take a more formal approach to the 
study of recursion. Because it forms the basis for many advanced algorithms as 
well as advanced methods of organizing data, a solid understanding of recursion is 
necessary to pursue further work in computer science. In our discussion of 
recursion, the term "subprogram" will be used to refer to any procedure or 
function. 
nl 
A Recursive Example : 


Consider the following simple recursive procedure: 


procedure Example (X :integer); 


begin 
writeln (’First writeln. X = ’, X); 
E S0 
then Example (X-1); 
writeln (‘Second writeln. X = ’, X) 
~ end; (* Example *) 


The output produced by the call Example (2) is: 
Running 


First writeln. X = 2 z 
First writeln. X= 1 
First writeln. X= 0 
Second writeln. X = 0 
Second writeln. X = 1 
Second writeln. X = 2 


Two common misconceptions are to believe that only the first half is printed (i.e. 
2 1 0), or that the second half consists only of zeros (ic. 2 1 0 0 O 0). To 
understand the output that is actually generated, it is necessary to realize that the 
computer saves the current values of its parameters and local variables cach time it 
makes a recursive call. 


—————————— an 


(ike Sie” 2 ee 
Saving Local Values during Recursive Calls 


Before a recursive procedure or function call is performed, the computer 
first saves the valués of any local variables and parameters in a special place in its 
memory called the "stack". A stack is a data structure which can be used to store 
information and is usually represented as a vertical set of boxes, one on top of 
another. For example, 
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Review 


1. What output would be gencrated by each of the following calls to the recursive 
version of Example? Draw a stack diagram which represents each call. 


a) Example (0); 


b) Example (1); 
c) Example (5)? 


2. What output would be generated by the following procedure 
procedure Review (N :integer); 


begin 
if N> O0 
then Review (N div 2); 
writeln (N) 
end; (* Review *) 
if the call Review (9) was made? Show the contents of the stack at each 
recursive call. 


A Useful Diagram 


Another way to diagram recursive procedures and functions is to draw a 
vertical series of boxes, cach of which represents the current values of the local 
variables and parameters, with down arrows indicating recursive calls and up- 
arrows representing the returns from these calls. For example, the call Example (2) 
would be shown as: 


(Call from program) aoe (Return to main program) 





The values written next to the down-arrows indicate the value of the 
parameter X used in each call, and the values written inside the box represent the 
local values for each call. (If this were a diagram of a function rather than a 
procedure, a value would be written next to each up-arrow showing the value 
returned by the function at that time.) Each separate box is referred to as a "level" 
of recursion. This diagram is useful because it shows all of the recursive calls, 
local values and parameters. 
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` The stack at this point has only one value: 


Call: Example (1) 
Example writes a | on the screen using the first writeln statement. 


The value of X, a 1, is saved on the stack. 


The stack now has two values stored on it. Note how the newest value is placed on 
top of the old one: 


za 


Call: Example (0) 
Example writes a 0 on the screen using the first writeln statement. 
Because X is not greater than 0, no recursive call is made. 
Example writes another 0 using the second writeln statement. 
The end statement is reached so the recursive call is terminated. 


Because no recursive call was made, the stack is unchanged: 


Example (0) terminates 
Example restores the value of X, a 1, from the top of the stack. 
Execution continues at the statement directly after the recursive call (in 
this case, at the second writeln). 
Example writes a | (the current value of X) on the screen using the 
second writeln statement. 
The end statement is reached so the recursive call is terminated. 


After X has been restored, in this case 1, it is removed from the stack: 


Example (1) terminates 
Example restores the value of X, a 2, from the top of the stack, and 


writes it on the screen using the second writeln statement. 
The end statement is reached so the original call is terminated. 


Restoring 2 removed it from the stack. The stack now contains no elements and is 
considered to be "empty". There are no more recursive calls so the process stops. 
The values of X that were printed are: 2 1 0 CoS) 2 


RI 
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The fact that the current value of Count is not destroyed when a level of recursion 
terminates can be seen by changing the second writeln statement to: 


writeln (‘Second writeln. X = ’, X, ’ Count = ’, Count) 


Calling Example with this change and X = 2 produces the output: 


First writeln. X = 2 Calls = 1 
First writeln. X = 1 Calls = 2 
First writeln. X = 0 Calls = 3 
Second writeln. X = 0 Count = 3 
Second writeln. X = 1 Count = 3 
Second writeln. X = 2 Count = 3 


and it would be possible to print the final value of Count, 3, in the main program: 


begin 
Callcount := 07 
Example (2, CallCount) ; 
writeln (’Called Example ’, Callcount, ’ times.‘); 


end. 


The following diagram shows ‘how the value of Count is passed back to each 
level so that the final value returned to the main program is 3: 


a (Return to main program) 





Applying Recursion To Solve Problems 


With this understanding of the operation of recursion, we can now consider 
problems which can be solved recursively. Not all problems lend themselves to 
recursive solutions, and one of the skills a programmer must possess is the ability 
to recognize which do and which do not. 


There are two properties that a problem must have for a recursive solution 
to be useful. The first is that the problem must be able to be reduced to one or 
more smaller versions of the same problem. These smaller versions will be solved 
by recursively calling the procedure or function. (The ambiguous term "smaller" 
will be defined below.) To take an example from the recursive Power function in 
Chapter 5, the value of x® can be found by multiplying X by X5. Thus the X° 


x 
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It is important not to confuse the information shown in this diagram with 


the information stored on the stack. The stack is used for the temporary storage of 


data only and contains no information about variable names, returns from calls, 
etc. 





Transferring Values between Levels of Recursion 


As shown in the diagram of Example, when a level of recursion terminates, 
the value of X becomes the value shown in the box (level) above. Thus, any old 
values of local variables or parameters are lost. (This is why the values written by 
the second writeln are 0,1,2.) The only way to share information (values) between 
different levels of recursion is to include a variable parameter which is used to 
store that information. : 


For example, suppose that Example wanted to print the number of times it 
had been called (Count). This value is increased by one for each call. Making the 
following changes to Example 3 

procedure Example (X :integer; var Count :integer); 

begin 

Count := Count + 1; 
writeln (‘First writeln. X = ’',X,” Calls =’, Count); 
ext > : 
then Example (X-1, Count); 
writeln (‘Second writeln. X = ’, X) 
end; (* Example *) . 


produces the output: 
Running 


First writeln. X = 2 Calls = 1 

First writeln. X = 1 Calls = 2 
First writeln. X = 0 Calls = 3 

Second writeln. X = 0 

Second writeln. X = 1 

Second writeln. X = 2 


The diagram below shows how the value of Count is updated with each cali: 


(Call from program) | 
2, 
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à (Return to main program) 










(Call) 





2,3 
' 8 is the final value returned. | ( 


Base = 2 
Exponent = 3 


Base = 2 
Exponent = 2 







Base = 2 
| Exponent = 1 
2,0| ae 


Y 1 


Base = 2 


Exponent = 0 





The numbers next to the up-arrows indicate the value returned by the function. 
Notice that the value of Base is passed unchanged to each recursive call of the 
function, while the value of Exponent is reduced by one in each succeeding call. 
Also notice that each level maintains its own copy of Exponent and Base and that 
these local copies are not affected by the values in other levels. 


The Need to Approach the Simple Case 


It is important to remember the need for each level of recursion to approach 
the simple case (in the above case N = 0). Some recursive algorithms appear 
plausible but in fact never terminate. For example, it is true that 


xN = x (N+1) / x 
and this might lead to the recursive algorithm: 
if N= 0 


then Power := 1 
else Power := Power(X, N+1) / X; 


The problem, of course, is that N increases with each call and therefore the simple 
case of N = 0 is never reached. 


a a eet Dee er nn acne gies ee LS 
Review 


3. A recursive function can be written to return the sum of all elements from 1 to 
N in an array. 
a) Describe how the problem for N can be solved using a smaller version 


of the same problem. 
b) Describe the simple case for this problem. 
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problem has been reduced to the smaller X5 problem. More generally, we can say 
that 


XN axe XO) 


The second property is that there must be a simple case of the problem 
which can be solved directly, without recursion. For example, in our XN problem 
the simple case is the calculation of X°, which is 1. The need for such a case allows 
us to define what is meant by a “smaller” problem: it is a problem which is closer 
to the simple case. 


These properties lead to the general structure of recursive solutions: 


if it is the simple case 
then do simple, non-recursive processing 
else do recursive call(s) and processing 


This structure can be translated into Pascal as: 


if N= 0 
then Power := 1 
else Power := X * Power(X, N-1)i 


In the recursive Power function from Chapter 5 this became: 


function Power (Base :real; Exponent :integer) :real; 
(* Returns Base raised to the Exponent power. Assumes 
that Exponent is non-negative. *) 


. begin 
if Exponent = 0 
then Power := 1 
else Power := Base * Power(Base, Exponent-1) 
end; (* Power *) 


Because the value of the exponent, N, is reduced for each recursive call, it must 
eventually reach a value of 0 (the simple case). Keep in mind, however, that if the 
function is called with a negative value for N, it will never terminate, but instead 
will generate a sequence of recursive calls for more and more negative values of N. 
The computer will continue to generate recursive calls until no memory is left for 
the stack to hold copies of the local values, thereby halting program execution with 
an error message. 


A boxes and arrows diagram will be helpful here as well. A call to Power 
with a Base value of 2 and an Exponent of 3 will look like this: 
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If this code replaces the previous code in the Power function, it correctly 
implements the new algorithm. 


A diagram of the execution of the new Power function is helpful. Consider 
the call with X = 2 and N = 9: 


(Call from program) (Return to main program) 





Two features of this solution are worth noting. First, it should be clear that 
the new algorithm is much more efficient than either the first recursive algorithm 
or the non-recursive algorithm. For example, to find X°?, the other algorithms must 
perform 32 multiplications, while this algorithm need only do 7 (two when N = 1 
and one for each of N = 2,4,8,16 and 32). The efficiency comes from the fact that 
the algorithm reduces the size of the problem by half during each recursive call. 


Second, this algorithm is complex enough that recoding non-recursively 
(iteratively) is likely to be difficult and prone to crrors. In this case the recursive 
version is preferable to the iterative one. 





Review 


5. Carefully trace through the last Power algorithm and determine how many 
multiplications are required for the following values of N. 


a) 4 d)7 
b) 5 e) 8 
c) 6 f) 16 
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4. Given the declaration: 


type 
ArrayType = array [1..100] of integer; 
Write a recursive function with the following header which solves the problem 
posed above: 


function Sum (var A :ArrayType; N :integer) :integer; 
(* Pre: N is in the range 0..100 
Post: Returns the sum of all elements in A{1..N) *) 


Se ee E E E E E E a a 
A Better Power Function 


It should be clear that the Power function can be easily rewritten using an 
iterative solution (that is, a solution which does not require recursion, but instead 


uses a loop). For example: 


Temp := 1; 

for Factor := 1 to N do 
Temp := Temp * X; 

Power := Temp; 


Here we begin with a value of 1, and then multiply it repeatedly by X. For 
example, to find X® we multiply 


IEX EX EX*EX EX *X SHAR 


This accomplishes the same action as the recursive function, but may be a bit 
clearer and does not require the computer to keep copies of local values on the 
stack. In general, if a simple non-recursive function or procedure can be written, it 
is to be preferred to the recursive one. We wrote Power recursively only as an 
example. There are, however, some cases where the recursive version of a 
subprogram is better than the iterative one. 


Consider, for a moment, how you would solve a large exponentiation 
problem using only a calculator. If the problem were x5? it is unlikely that you 
would perform 32 multiplications. Instead, you would recognize that if X16 could 
be found it could be squared to produce the correct answer. Similarly, X!° can be 
found from X® and so on until the simple case (X°) is reached. This leads to the 
recursive algorithm: 


if N=0 
then return | as the answer 
else find X™/? and square it 


Before this can be turned into Pascal code, two modifications must be made. First, 
the algorithm is only useful with integer exponents, so the division must be 
performed using div. Second, the algorithm must handle cases where the exponent 
is odd. For example, to find x°’, we find X4, square it, and then multiply by one 
more factor of X. In Pascal, our new algorithm becomes: 


if N= 0 
then Power := 1 
else 
begin 
Half := Power(X, N div 2); 
if Odd(N) 
then Power := 
else Power : 


af * Hale rX 


H 
= Half * Half 


end; 
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(Call from main program) A (Return to main) 








Fib(N-1) 


Fib(N-2) 


Peds 


This diagram. also illustrates how inefficient this approach is for the 
Fibonacci problem. Even for this small case, we can see that Fib(2) is evaluated 
twice, and Fib(1) and Fib(3) once. This: becomes inefficient for larger values of N 
because of the number of redundant calculations and calls. 


Often multiply-recursive procedures and functions can be rewritten without 
the use of recursion and with a considerable improvement in efficiency. This is 


certainly the case for the Fibonacci function. An iterative solution is: 


function Fib (N :integer) :integer; 
(* Pre: N > 0O 
Post: Returns the Nth value in Fibonacci series. *) 


var 
Item1, Item2,-NewItem, ItemNum :integer; 
begin 
if N=1 
then Fib := 0 
else if N = 2 
then Fib := 1 
else 
begin 
Item1 := 0; 
Item2 := 1; 
for ItemNum := 3 to N do 
begin 


NewItem := Iteml + Item2; 
Item1 := Item2; 
Item2 := NewItem 
end; (* for *) 
Fib := NewItem 
end 
end; (* Fib *) 
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6. The most recent Power algorithm can be modified to divide by threes rather 


than by twos. For example, to find X°®, first find X? and then cube it. Write 
Pascal code to implement this idea. (Hint: You will need to consider three 
recursive cases.) Do you think this will be more or less efficient than the twos 


version? 


ooe a ee ee a cecum gm areas Se Sea 


Multiple Recursive Calls 


In each of the previous examples, cach call of a procedure or function 
generated either one or zero recursive calls. For example, Power(2,3) generated 
Power(2,2) and Power(2,0) did not generate any calls. Many of the more interesting 
problems that can be solved using recursion require the generation of more than 
one recursive call. 


A frequently used example of this "multiple recursion” is a function to 
generate members of the Fibonacci series. This series is a sequence of numbers 
where cach number is the sum of the two values preceding it. The first two values, 
which do not have two predecessors, are usually taken to be 0 and then a 1. The 
first few values are: 


O-1-12.2 3.5.3 13.2) 34.332 


A recursive function for finding the Nth number in the series can be easily 
constructed using the definition of the series: : 


function Fib (N :integer) :integer; 
(* Pre: N > 0 
Post: Returns the Nth value in Fibonacci series. *) 


begin 
if N=1 
then Fib := 0 


pos- : 
else Fib := Fib(N-1) + Fib(N-2) 
end; (* Fib *) 


Several things are of note here. First, there are two "simple cases" - the case of N = 
l and of N = 2. We need both because a call to Fib(3), for example, will generate 
both.the calls Fib(2) and Fib(1), and Fib(2) does not reduce to Fib(1). 


More importantly, we can see that each call to a non-simple case gencrates 
two recursive calls: Fib(N-1) and Fib(N-2). When the first call is encountered, it is 
executed and completed and then the second call is performed. When both calls 
have returned their values, the values are added together and the total returned as 
the result of the original call. The following diagram illustrates the call Fib(4): 
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Review 





9. Write algorithms for recursive functions which: 


a) Find the number of 0’s in an integer array. j : 
b) Find the average of the elements in an integer array (Be carefull). 
Hint: The sum of A[1].A[N-1] can be expressed as (N-1) * Average(A[1]..A[N-1]) 


10. Write recursive and non-recursive versions of the functions described in 
Review 9. 


A Complex Example of Recursive Processing 


Of greater value are recursive procedures which perform operations on 
structured data. In this section we wil! construct such a procedure, and in so doing 


create a new data type for ease in processing. 


Consider the problem of generating a list of all possible N-letter "words" 
consisting only of the letters A, B, and C. For this problem, a word is defined to be 
any combination of the three letters, whether it is a rea] word or not. One way to 
handle this recursively is to select a letter, and then append all of the possible N-1 
letter words. Of course, the N-1 letter words can be generated by selecting each 
letter and then generating all of the N-2 letter words. Eventually words with no 
letters appended will be generated. This is an easy task and forms the simple case 


of the recursion. 


The recursive algorithm is: 


if number of letters left to.pick is 0 
then print the word as is 2 
else for each of the possible letters : : 
append the letter 
print all words formed by recursively appending N-1 letters 
remove the added letter 


An actual implementation of this procedure requires a method of storing words. 
This is a good place to put the concept of an abstract data type to use. The 
algorithm requires four operations on words: initialization, appending a letter, 
printing a word, and removing a letter. The headers for the procedures to perform 


these operations are: 


procedure InitwWord (var Word :WordType) ; 


(* Pres none : 
Post: Word has been initialized to empty *) 


procedure PrintWord (Word :WordType) ; 
(* Pre: Word is a valid word 
Post: Word has been printed on the screen *) 


procedure Append (var Word :WordType; Ch :char); 
(* Pre: Word is a valid word with < MaxLen letters 
Post: Word is the original word with Ch appended *) 


procedure Truncl (var Word :WordType) ; ; 
(* Pre: Word is a valid word with at least one letter 


Post: Word is original word without last letter *) 


oh 
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This new version is a bit more difficult to understand, but is immensely more 
efficient. It covers the same two simple cases, and then uses a loop to calculate the 
other values by repeatedly adding the previous two numbers in the series to find 
the next. 

DE E ae eee et ee Se ee E E ee 
Review 

7. Determine the number of additions required by calls to the recursive Fib 

function for cach of the following values of N: 


a) 3 d) 6 
b) 4 e) 7 
Cj =: f) 8 


8. Repeat this problem for the non-recursive version of Fib. 
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Recursion With Structured Variables 


Most of the examples covered thus far have had simple data type values as 
their input and their results. More interesting recursive procedures and functions 
can be written which act upon structured variables, such as arrays. 


The most basic of these are functions which return some numerical property 
of an array. For instance, Reviews 3 and 4 describe a function to find the sum of 
all elements in an integer array from 1 to N. The recursive solution involves 
finding the sum from | to N-1 and then adding the Nth element: 


function Sum (var A :ArrayType; N :integer) :integer; 
(* Pre: N is within the range of elements of A 
Post: Returns sum of elements from A[1] to A[N] *) 


begin 
if N=. 0 
then Sum : 
else Sum : 
end; (* Sum * 
A function such as this is not difficult to write, but neither is the iterative 
version: 


function Sum (var A :ArrayType; N :integer) :integer; 
(* Pre: N is within the range of elements of A 
Post: Returns sum of elements from A[1] to A[N] *) 


= 0 
= A[N] + Sum(A, N-1) 
) 


var 

Temp, . (* Temporary sum value. *) 
' Index :integer; (* Index to element being summed. *) 
begin 

Temp := 0; 


for Index := 1 to N do 
Temp := Temp + A{Index]; 
Sum := Temp 
end; (* Sum *) 
Indeed, this second version may be clearer and is more memory efficient because it 
does not require the computer to keep local values on the stack as the recursive 
routine does. 
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Questions 6 - 10 are based on the following procedure: 
procedure Test (N :integer); are ee 


begin 
if N >O 
then Test (N-1); 
write (N:2) 
end; (* Test *) 


6. What output is produced by the call Test (5)? 


ay 0.2: 2.3. 4°5 
b543210 

eis 2 4. 5 

d) O 

e) No output is produced because the procedure has a syntax error. 


7. If the output from Test is only a single digit, which of the following is 
a possible original value for the parameter N? 


j ates 1) 
Hol 
Ill. -1 
a) I only. 
b) IT only. 
c) III only. . 
d) I and III only. 
e) I, Il, and III. 


8. Which of the following is the original value of the parameter N if the 
stack produced by Test is: 


2 


` 


(The top box represents the top of the stack.) 


a) 0 
b) 1 
e) 2 
d) -1 
e) This is not a possible stąck for Test. 
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Chapter AP 3 Test 


PART I Choose the best answer for each of the following questions. 


1. 





"Recursion" is 

a) the process in which a procedure or function calls another procedure or 
function. 

b) the process in which a subprogram calls any other subprogram. 

c) the only way to place values on a stack. 

d) the process in which a procedure or function calls itself. 

e) the only way to remove values from the stack. 


An iterative subprogram 


a) uses only loops, no recursion. 

b) uses only recursion, no loops. 

c) uses recursion within a loop. 

d) has more than 1 recursive calling statement. 
e) has only 1 recursive calling statement. 


When a recursively called subprogram terminates: 


a) any local values are placed on the stack. 

b) a set of local values are restored from the stack. 

c) any local values are written on the screen. 

d) any variable parameters are placed on the stack. 

e) any variable parameters are restored from the stack. 


To be solved recursively, a problem should: 


E be able to be expressed as a smaller version of the same problem. 
II. have a simple case which can be solved non-recursively. 
II]. require at least one variable parameter. d 


a) I only. - 
b) II only. 
c) III only. 
d) I and II only. 
e) I, II, and II. 


Recursion may be inefficient for simple subprograms because: 


j f of the space required to save local values on the stack. 
II. of the time required to save local vaiues on the stack. 
lll. recursive subprograms are usually longer than non-recursive ones. 


a) I only. 

‘b) II only. 

c) IIT only. 

d) I and H only. 
e) I, II, and HI. 





AP 3.44 AP Supplement Chapter Three 


PART II Write each of the following based on the following definition of 
ListType. z 


const : 
MaxLen = 100; 


type 
ListType = record 
Item array [1..MaxLen] of integer; 


Length :integer 
end; (* ListType *) 


a) Write a recursive function named CountOdds which returns the number of 
odd integers in a ListType variable. 


b) Write a recursive function named Sum which returns the sum of the elements in 
a ListType variable. 


c) Write a recursive procedure named WriteOut which priats all of the elements 
with an even index in ascending order by index, followed by all of the 
elements with an odd index in descending order by index. For example, the list 


1,2,3,4,5,6 would be printed as 2465,3,%. 


\ 
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` The Test procedure is changed to include a variable parameter, M: 


procedure Test (N :Integer; var M : Integer) ; 


begin 
M := N; 
af > 0 


then Test (N-1, M); 
write (M:2) 
end; (* Test *) 


9. Assuming M is declared as an integer, what output is now produced by 
the call Test (5, M)? - 


a) 0 

aT Se 2-3 4 5 
Cao 4. 3-2 1.0 
dJi 2345 
H0000 0O 


10. What is the last value of N placed on the stack by the call given above? 
a)-l 
b) 0 
Ca 
d) 4 
e) 5 


PART II Given the following multiple-recursive procedure: 
procedure MultRecur (N :integer); 


begin 
if N >-0 
then — 
begin 
if not Odd(N) 
then MultRecur (N+1); 
MultRecur (N-3) 
end; 
writeln (N) 
end; (* MultRecur *) 


Show the output produced by the call: 
MultRecur (2); 
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” or “Big O of n”. Sequential search of a sorted file is 
d file. However, time to sort a file, expected 
file must also be considered. For example, 
dering a file based on 


O(n}|—read “on the order of n 
generally better than of an unsorte 


number of searches, or other uses of the 
if certain key values are more likely to occur than others, or 
probability of key value occurrence may be superior to sorting the file. 


2. Binary Search 


ficient than sequential search. At each 


Binary search of a sorted file is more ef 
itially the positions of kı 


stage of binary search, the remaining file is split in half. In 
and k, are considered left and right end pointers set to left and right, respectively 
lie, left = 1, right = n). Then k, is compared to k, where j = ({left + right}/2). If ko 
= k; then the search is successful. If k, < k, then right becomes right — 1 and the 
next comparison is ma 


is found or until left > right. 
Algorithm: Searches a fle in nondecreasing order. 


de at ({left + right}/2). This process continues until the key 


if Left >Right 
then key is not in file 


else 
begin 
j= {las Right) div 2; 
then key is found 
else if ko < Ki : 
then search for k, between Left and (Right = (j — 1)) 
else search for k, between (Left = (i + 1)) and Right 
end; 


Note that the algorithm is recursive. An iterative non-recursive algorithm 


could also be developed. 


EXAMPLE 


a) k, = 25 
Assume key values are 2,7,9,10,25,28,40, and 45 


Then Left = 1, Right = 8 (the position of key values 2 and 45, respectively). 


Left Right 
| | 


27 9 10 25 28 40 45 


Left > Right is false. j = (1 + 8) div 2 = 4. (k, = ky = 10). k, > k; Left =j+1 
= 5, Right = 8. (The Ble is cut in two and Left is reset.) 


a. 
— 


| 
| 
[l 
[l 
| 
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CHAPTER 5 


Searching Algorithms 


Sorting and searching collections of data are standard techniques which exem- 
plify various kinds of algorithms and data structures. We will assurne that a collec- 
tion of data objects are in a file and that the file will be sorted or searched on a key, 
that is, a specified field of each data object. For example, the data objects may be 
personnel records and the key may be the Social Security number. In that case we 
will refer to sorting or searching the file of personnel records on the key Social 
Security number. In general, we will use the following notation: K = key, R = 
record, k, = the value of the key in the i® record, n = number of records in a file, k, 

- = given key value. ; ge 3 


1. Sequential (Linear) Search 


Searching a file sequentially involves reading each record m, rz .. . Tn iN order 
and comparing each k, (value of the key} with k, until k; = k, for some j = 1,..,n or 
until it is known that the key is not in the file. If the file is not sorted on K and k, 
is not in the file, then reading continues until end of file is reached. If the file is 
sorted and the key is not in the file, reading continues only until the first record 
for which k; > k, or k,’< k,, depending on whether the file is in nondecreasing or 
nonincreasing order on K. 


Analysis: 


For non-sorted files, sequential search may be the most likely choice of search 
method. The maximum number of comparisons (i.e., the worst case situation) isn 
and occurs when the key sought is in the last record or is not in the file. If all keys 
are equally likely, then the average number of comparisons is (n + 1)/2. 

For sorted files, the maximum number of comparisons is still n but its occur- 
rence is no more likely than if only one comparison were made. Thus an average 
search involves n/2 comparisons. Common notation for time proportional to n is 


30 oN 
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Note the recursive algorithm has at least one path which provides an exit 
from the sequence of recursive calls. Specifically, “if Left > Right then key is not 
in file” provides one path and “if k, = k; then key is found” provides another. Any 
recursive algorithm must have this characteristic, namely that the recursive se- 
quence must eventually reduce to at least one nonrecursive case. Otherwise the 


algorithm will not terminate. 


Analysis: 

Each time a comparison (k, vs. k;) is made, binary search divides the remaining 
file elements in half. Thus, after 1 comparison, a maximum of 1/2 the number of 
key values are left; after k comparisons, (1/{2*}) key values remain. The maximum 
number of comparisons to find a key value or to determine it is not in the file, 
then, is the value of k such that 2* = n, the number of key values in the original 
file. Thus k = logn and we say that binary search is an O(log:n) algorithm.’ 

To compare the approximate efficiency of algorithms which are O(n) and 
Q(log.n), consider the following table: 














O(n} O{log.n) 
l 
2 
3 s 
16 16 4 
1024 1024 10 
65,536 65,536 16 
262,144 262,144 18 


For large files, binary search is clearly more efficient than sequential search. 
There are, however, limitations on the use of binary search. Elements must be 
stored so that they may be accessed randomly, such as in an array. Binary search 
will not work with a (one way) linked list, for example, and is less effective in 
applications involving large numbers of insertions and deletions. Also, elements 
must be sorted on the same key as used in the search. Finally, a nonrecursive 
version of the binary search algorithm should be implemented for the greatest 
efficiency; recursive algorithms may make a solution process clearer but their 
implementations require large amounts of processing time. 


3. Hash-Coded Search 


Hash coding has the objective of using only one comparison or access [com- 
monly called a “probe”) to determine whether or not a key value is in a file. The 


algorithm consists of two main parts. 
a} A hash function f{{h) which associates an address (or a representation of an 


address such as an array index) with a key value. 


‘For a discussion of logs, bases and binary search see Appendix C. 
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Left Right 


t { 


: ‘27 9 10:25 28 40-45 


ii) Left > Right is false. j = (5 + 8) div 2 = 6 (k; = ks = 28). ko < k,. Left = 5, Right 
= j — 1 = 5 (The file is divided again, this time with Right reset.) 


Left Right 
H 


2-7- 9 I0 2 2-0 a 


iii) Left < Right is false, j = (5 + 5) div 2 = 5 (k; = kę = 25), ko = k; key is found. 


b) k, = 5 
Then Left = 1, Right = 8. 


Left Right 


—<—— 
<~ 


2 7 9:10 25 28 240-45 


i) Left > Right is false, į = (1 + 8) div 2 = 4, (k; = k, = 10). kẹ < k, Left = 1, 
Right = j — 1 = 3, |k, = ks = 9). : 


. Left Right 


—_ 
<< 


2 7 9 10 25 28 40 45 


ii) Left > Right is false, j = (1 + 3) div 2 = 2, (k, = kz = 7). ko < k, Left = 1, Right | 
=j—- = 1, (kj =k, = 2). 


Left Right 


ate 
—t— 


2-7 9 30 25-238 40 4% 


iii) Left > Right is false, į = (1 + 1) div 2 = 1, (k, = k, = 2). kə > k, Left =j+1= 
2, Right = 1. ; 


Right Left 


tt 


Z- 7 9 10° 36°26 8 


- iv) Left > Right is true. Therefore key is not in file. - 
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tions hh ...,hẹ are determined h; = (h, + j?) mod(array size). In the 
example, h, = 7, ho = 7 + 1° = 8. The key value 40 would be stored in A[8]. 
Had A[8] been occupied, the successive probes would be a h, = (7 + 2”)mod 
11 = 0, h; = (7 + 3*}mod 11 = 5, h, = (7 + 4’}mod 11 = 1, etc, until an 
open location is found. ite 


e} rehash 
choose a second and different hash function g. Initial probe h, would be at 
the address determined by the primary hash function f. If a collision oc- 
curs, use g on the key value to determine the next location h,. Successive 
locations h, are determined by h; = (j + g(k})mod{array size). . 


Note that in all of these techinques, a process called “clustering” can occur. ` 
Once collisions occur, sequences of them can be generated. For example, using 
constant displacement, if additional key values 29 and 62 were hashed, then 29 
would experience the same collision as 40, and 62 would experience the same 
collision as 40 and 29. The possibility of clustering increases as the ratio of occupied 
locations to array size increases (the load factor}. Hash functions and collision-han- ‘ 
dling techniques should be chosen to avoid clustering whenever possible. 


2) Chaining 


Chaining involves maintaining linked lists as a means of collision handling. 
Each array location A{j],*j = 0, . .. (array size — 1}, contains a pointer to a linked 
list of key values which hashed to j. Initially, the array locations contain nil, the 
pointer to an empty list. Each key value is then inserted (in order, usually) into its 
appropriate linked list. Ideally, short linked lists result. If lists become too long, 
the purpose of hashing is negated. Considerations-should be given to starting over 
with a different array size, a different hash function, or both, or a different search- 


ing method. ; 
EXAMPLE: Assume file key values are 10,28,2,7,9,42,25, and 40 and array 
size = 7. Then the results of using the division method with chaining are as 


follows: 
ee gees 
28 med a | min 
w [| | Ta) 
a | ol 
mo PE 2a 
45 mod 7 = 3 —>|40 | nil | 


25 mod 7 = 4 


40 mod 7 = 5 : ‘ gui ul 
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b) A collision-handling mechanism to store a key value when two or more 
values hash to the same address. 

Most commonly, an array is used to store key values (and their records or 
pointers to their records). Array indices, then, are the values determined by the 
hash function. Generally the array size is larger than the number of key values to 
be inserted. The hash function should scatter key values throughout the array as 
randomly as possible. The same hash function used to store elements must be 
used to search for an element. 

There are numerous hash functions possible and various collision-handling 
techniques. A common hash function, mod, is used in the division method. 

EXAMPLE: Assume the key values are 10,28,2,7,45,25, and 40. 

Because file size n is 8, we choose array size 11, a number larger than n and 

also a prime number. Then the array index is determined by k mod 11. 


2 
7mod1ll= 7 
9modll= 9 
1 
3 
7 


45 mod 11 = 
25 mod 11 = 


Array index 


Key value 





An attempt to insert key value 40 in A[7] causes -a collision—a key (< > 40) 
already is stored there. A collision-handling technique must be used. Common 
ones include: 


1) Open addressing 


a) constant displacement E 
Add a constant value, d, to the hash address. . 
Repeat until an open location is found. 


In the example, if d = 1, then probe at 7 + 1 = 8. Insert 40 in A{8}. If A[8] 
has been occupied by a key value < > 40, then add 1 to 8 and try to insert 
50 in A(9], etc. 


random displacement 


b 


— 


Call pseudo-random number generator to determine the displacement d. 


— 


c) linear quotient 


Use the quotient resulting from the division of k by the array size as the 
value of the displacement d. In the example, 40 mod 11 = 7, 40 div 11 = 3. 
Thus d = 3 and the second probe would occur at (7 + 3) = 10. Successive 
probes would occur at (10 + 3) mod 11 = 2, (2 + 3) mod 11 = 5. The key 
yalue 40 would be stored in A{5]. 


d) quadratic quotient 
Determine initial location h, by the division method, then successive loca- 
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9. Searching 


9.1 The Need for Search 


In this chapter we study search techniques. Searching is one of the most commen uses to which 
computers are put, Search is at the kernel of operating systems, Search is fundamental to modern 
techniques of problems solving, artiticial intelligence, and robotics. And, search has been introduced in a 
number of places in this book already. 


Everyday applications of searching abound, Computers are used to search for medical records when a 
person is admitted to the hospital. A computer is uscd to search the nation’s parts suppliers for 
transmission bearings needed to repair a truck. A computer is used to search for open scats when a 
person arrives for an airline flight. A computer is used to search credit records when a person applics for 
a loan. 


Operating systems have search as one of their fundamental tasks. They search secondary storage fora 
file that contains a program that a person wants to execute or modify. They search primary storage for 
free memory to be allocated to an executing program. They search user-identification and password files 
to help enforce system security. 


Problem solving involves searching a solution space. Artificial intelligence focuses on search to a 
staggcring degree. Just reading part of the table of contents of a recent and important text in artificial 
intelligence [5] includes depth first, breadth first, hill climbing, best first, alpha-beta, and AND/OR. And 
this doesn’t even touch beam, mini-max, best, means-ends and a host of others too numerous to mention! 


We have already used simple search techniques in this book. In Chapter 2 we taught a robot to search 
for beepers, walls, and doors. In Chapter 6 we searched arrays and arrays of records for specific items. In 
chapter 7 we searched linear linked lists. 


There are indeed a plethora of search techniques. They range from the mundane applications to the 
very esoteric. In this chapter we are concerned with only those techniques that are at the foundation of 
the discipline. ; 


9.2 Basic Concepts 


When we speak of searching in this chapter we mean that we are inspecting a collection of items for 
some particular item. Generally the items of the collection are referred to as records? We search a 
collection of records according to some search key. Suppose you are admitted to the emergency room of 
the local hospital. A hospital employee makes a search of the medicai records on file for your record, 
looking for medical history, allergies. and whether or not you pay your bills. Come to think of it they 
probably check the insurance that you carry first of all. They search for your name or possibly your social 
security number. The thing being searched for (i.c. the name or social security number) is called the 
search key. Both your name and social security number are items of information that are embedded in 


your medical record. We say that these are embedded keys. 


net 


ahia student should realize from knowledge of abstract data types that this docs not imply that scarch requires an 
implementation using Pascal records. 


Vile Date: January Ist 1985 at 22:42 Scribe Date: January 2nd 1985 at 00:36 
Vile Name: CHAPTER 12.MSS.66 line 2 Scribe Version: 4(1400) 





*» 


- 


36 


Review of Selected Topics 


Array values 1, 6, and 7 still contain nil. Only array values 0, 2, 3, 4, and 5 are 
needed as the dynamic allocation takes place. ‘ 

Note that chaining requires extra space for pointers but that dynamic alloca- 
tions of nodes in Pascal can alleviate some of the potential space problems. Also 
note that in the worst case, all key values will hash to the same address (array 
index} with the result that chaining reduces to maintaining a single linked list. 


Analysis: 


The number of comparisons (probes) required to determine whether an ele- 
ment is or is not in a hash table (an array constructed by hashing) could vary from 
one in an ideal case to n, the number required to perform a sequential search on a 
linked list. The actual number depends on many parameters, such as size of array, 


hash function, collision-handling technique, type of data, and amount of data. 


Once a reasonable data table is established, seaching should be very efficient. 


4, Searching an Ordered Binary Tree 


See discussion under “Sorting an ordered binary tree.” 
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9.4 The Log(n) Ordered Search | ? 


a relatively simple change. Hets look atan . 
ting that we first do is make sure that the 
ablishinig that the new 


We can improve on the performance of this algorithm with 
example. When we want to add a person to the payroll one U 
new employee person in not aiready on the payroll, We search the payroll, est 


employee isn’t already an employee. 


How many records must we inspect to be sure that the new employee isn’t on the payroll? Using 
sequential search we have to search the entire list of records! What if we knew something about how the 
elements were ordered in the list? Specifically, what if we knew the list was ordered in strictly increasing 
order? It turns out that this added knowledge enables a much faster search. If we find an embedded key 


which is larger than the search key, we need not examine any of the remaining embedded keys since they 
are all larger than the search key and therefore not possibly equal to it. 


The following function illustrates searching an ordered list 


FUNCTION OrderedSearch(list, key): INTEGER 


BEGIN 
index + 1 
WHILE (index < HumberOfElements) AND (listLindex] < key) DO 


index « index + 1 


“If (index < NumberOfElements) AND (listLindex] = key) 
THEN RETURN( index) (* return index of matching record *) 


à ELSE RETURN(0) (*.no match *) 
END s 
Once the search key has been compared with the embedded key, casper.the search terminates. This 
leads to a substantial savings in search time. " 


9.5 Binary Search 


The algorithm for ordered search cuts down search times for search keys that are not in the list. What is 
its effect for search keys that are in the list? Surprisingly it has no effect at all. Since a search key is just as 
likely to fall at the end of the list as at the beginning, search times for elements in the list are still O(n). 
We extend the idea of exploiting knowledge of the order of the embedded keys in this section. 


Once we come to an embedded key which is larger than the search key, we know not to examine any of 
the remaining embedded keys. Likewise, if an embedded key is smaller than the search key, we know we 
need not examine any of the ernbedded keys which precede it. By examining one embedded key ina 
ordered list, we can cither ignore all embedded keys before it or all embedded keys after it. 


If we commence our scarch from only one end of list, we only take advantage of one half of the 
information available. ‘Therefore we examine an embedded key which is at the middle of the list. After 
only one comparison we can ignore either the first half of the list or the last half of the list , but always no 
tess than half. Of course, if the embedded key at the midpoint of the list matches the search key, the 


search is over. 


The following pseudo code illustrates this type of search which is known as a binary search. 
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‘There are some commonalities to all types of search addressed in this chapter. First there is a data 
structure that is being, searched. There is a search key, a means of distinguishing the record that we want. 
There is an embedded key on each record, a way that cach record is distinguished from every other record. 
In this chapter we consider only those search algorithms which match a search key with an embedded key. 


9.3 Sequential or Linear Search 


Remember the payroll problem of that we discussed in Chapter 9? A company had a shoe box full of 
Suppose that the company wants to give one employee, Danicl Boone, a pay raise. It 
must search for Boone's record and then modify its contents. It is casy to abstract the properties of this 
kind of search. ‘There is a list of employee records, each having an embedded key, and there is a search 
key (Boone). ‘The technique is to compare the name ficld of the first record (embedded key) with the 
search key. If it matches then we have found the record. Ifit doesn’t match then we compare the name 
ficld of the following record with the search key. ‘This process continues until cither the record with 
embedded key matching the search key is found or the entire list has been inspected and there is no 


match. 


employee records. 


In this illustration the data structure being searched is the list. The embedded key is the name field of 
the record. ‘The search key is Boone and is compared to the embedded key. Abstracting this into pseudo 
code we consider the list to be implemented as an array, list{].. NumberOfElements}. ‘The search is a 
function which evaluates to 0 if no embedded key matches the key. Ifan embedded key does match the 
search key the function evaluates to the index of the record containing the matching embedded key. 


FUNCTION SequentialSearch(Tist, key): INTEGER 


BEGIN 


index + 1 > : 
WHILE (index <$ NumberOfElements) AND (VistLindex] \ key) DO 


index « index + i 


IF index < NumberOfElements 
THEN RETURN( index) (* return index of matching record *) 


ELSE RETURN(0) (* no match *) 
END 


The clements of the list may be in the order in which people were hired, they may be in the order of the 


data entry person’s preference, OF they may be in no particular order. The important point is that as a 
programmer we don't have any knowledge about where in the list the embedded key matching the search 
key might be found. Therefore, if the clement being searched for is present, it is equally likely in any 
position, If we are lucky. the search key will match the first embedded key. If we are unlucky, the search 
key could match the last embedded key. On the average we expect to find it in the middle, taking n/2 
comparisons where n is the number of clements in the list. If the element is not in the list, this algorithm 


requires n comparisons since we must examine every clement in the list to establish that it is not present x 
the number of elements 


Because the number of comparisons needed to find an item varies lincarly with 


in the list, sequential searching is an O(n) search technique. 


ee 
Vile Date; January Ist 1985 at 22:42 
Pile Name: CHAPTER 7.MSS.66 line 82 


Scribe Date: January 2nd 1985 at 00:36 
Scribe Version: 4(1400) 





Searching 445 


Consider the problem where the universal set of search elements has a relatively small number of 


elements like the 26 case-folded English letters. For this type of problem we. may use a vector of 26 
boolean values. Ifthe third element of the universal set, cis present, the third element of the vector has a 
value of true. If the e is not present the third clement of the array has a value of false. Because the 
cardinality of the universal set is small the following fragments illustrate how this works: 

i $ 


PROCIDURE Insert(koy) 


BEGIN. 
VistLord(key)] = TRUE 
END 


PROCEDURE Delete(key) 


BEGIN i 3 
TistLord(key)] = FALSE 
END 


PROCEDURE Search(key) 


BEGIN 
RETURN( 1istLord( key) }) 
END i 


The set of (case folded) English Ietter can be mapped into indices for an array of boolean values by 
using the standard functton ord. This is known as a bit vector representation. It has two requirements: 
All elements are ordered C}, Cy Cz =e Cn and the number of elements of the universal set are relatively 


small. 


It is clear that this bit vector scheme searches in constant time. The natural question is, can we search 
any collection of data this way? Unfortunately the answer is no. Every computer's memory is finite. 
Therefore, there is a practical limit on the size of the possible bit vector. Representing a set of case-folded 


letters was quite simple as there are only 26 of them. 


Say we wanted a sct of English words instead ofasct of letters. Although words satisfy the requirement 
of an ordering (words may be alphabetized), they do not fulfill the second one. It makes no difference 
even if we only want to store just a few words. The constraint is on the universal set and not the actual 
search set Even if we limit the words to 6 letters or less, the universal set has over 320 million 
possibilities. So you see, it is quite casy to exceed the capacity of a computer with moderate amount of 
memory. Unfortunately most applications have universal sets which are too large for a bit vector 


representation on typical computers. 


We can extend the bit vector idea to handle search spaces that have large universal sets. Suppose we 
need to store 26 words of the 320 million possibilities. You might think that we could usc the first letter 
‘of cach word for an ordinal value. However, if some or all words have the same first letter, this won't 
work. What we need is a function which takes the embedded key as input and returns a unique table 
index as output. When adding data to the table, we simply apply this function to the embedded key to 
determine the index where to place the data. Similarly when search for data, we apply function to the 


scarch key to determine the index where to look. 
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FUNCTION BinarySearch({ lowerbound, upperbound, key): indox 


BEGIN 
midpoint (lowerbound + upperbound) DIV 2 


IF list{midpoint] = key 
THEN RETURN(midpoint) 
ELSE IF list[midpoint] < key 
THEN BinarySearch(midpoint+t, upperbound, key) 
ELSE BinarySearch(lowerbound, midpoint-1, key) 


END 


‘The binary search is much more efficient than the sequential search or ordered search. An illustration 
will help you sec this. 


Suppose that we are searching the Pittsburgh phone directory for a search key, Farnsworth. We'll 
consider the sequential and ordered search algorithms first. There are about 500,000 listings in this 
directory. ‘The first step using the sequential or ordered scarch establishes clearly that Farnsworth is not 
the first embedded key. This leaves 499,999 embedded keys to inspect. ‘Ihe second step establishes that 
Farnsworth is not in the second position, reducing the list to be searched to a mere 499,998 embedded 


keys. Clearly this process can take a while. 


Now think about scarching the directory using the binary search algorithm. This first comparison 
establishes that the search key, Farnsworth, comes before the embedded key Lahey (say Lahey is the 
middle element in the directory). Every clement following Lahey is eliminated from the search. We are 
now searching a list of only 250,000 clements! ‘Ihe second compare establishes that the search key 
follows Dravo, reducing the list to be searched to those records falling between Dravo and Lahey. This 
cuts the list to be searched down to 125,000 elements. In fact, we know it will take no more than 18 


< comparisons, to either find Farnsworth or determine that Farnsworth is not one of the 560,000 embedded 


keys present. 


‘The analysis of the binary search algorithm is revealing. In the best case, the embedded key at the 
midpoint is the one we are searching for, requiring only one comparison. In the worst case the embedded 
key that matches the search key is first on the list or last on the list or it’s not even present. In this case we 
keep dividing the list by 2 until the list only has one clement in it. This requires log,(n) or O(log(n)) 
comparisons. This is an order of magnitude better than the other search techniques that we have seen. 
The average case for binary search requires O(log(n)) comparisons as well. 


9.6 Bit Vectors and Hash Search 


At this point we understand how much better binary search is than simple sequential search. The 
natural question is, can we do any better? We aren't interested in a little bit better, finding algorithms 
that are of order O(log(n)) and a smidgen faster than binary search. What we are concerned about is the 
existence of algorithms that are as much better than binary search as binary search is better than 


sequential search. 


An algorithm that searches att order of magnitude better than binary search would search in constant 


time! It doesn’t seem possible that a search could converge in about one step, regardless of the cardinality 
of the data being searched. This would indeed be a very fast search. In this section we explore just such 


an algorithm! 
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Another collection of techniques include increasing table size as the table gets “fuller”. Still others 


include moving to the next empty hash address, while others include rehashing and double hashing 
techniques. Chaining is very simple and usually works just fine. 


9.7 Binary Search Trees 
As we showed in Chapter 10, trees are an excellent way to represent data. Binary trees (cach node has 2 
or fewer children) are commonly employed for searching. To be of use cach no 


property: every node is in the lefichild has a smaller key than the node and every node in the rightchild 


has a larger key than the node. As before, when we watch fora particular key, we compare the search key — 


with the node. If the search key is smaller we search the left child. If the search key is larger we search 
the right child. If the search key equals the node's key, the search is complete. The following pseudo 


code illustrate the algorithm: 
FUNCTION BinarySearch(tree, searchkey) : node 


BEGIN 
IF key(node) = searchkey , 
THEN RETURN(node) 


ELSE IF key(node) < search key 3 ` 
THEN BinarySearch(rightchild(node), searchkey) 


ELSE BinarySearch(leftchild(node), searchkey) 
END : : 


The similarity with the binary array search given before should be obvious. If the binary tree is 
balanced, the two are equivalent. Recall that inserting sorted (or nearly so) data into a binary tree leads to 
an unbalanced tree, so you can’t simply apply this technique without thinking. 


9.8 Summary of Searching Algorithms 


The following table illustrates the performance of the search techniques presented in this chapter. 
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‘The rule that this function plays is entirely analogous to the role that ord played in our example of 
building a search table and searching the bit vector. ‘The name for the family of functions which performs 


these mappings is the Hash functions. 


We are going to build a table that is much like the bit vector. The table is called the Aash table and is 


_ implemented as an array. The universal set is much too large to have an element of the array dedicated to 
~ each clement of the universal set. However, we only expect to have a very small subset of the universal 


sct appear as elements in our hash table. We reserve many more elements for the hash table than we 
expect to put into the table. Then we select a hash function that spreads the keys out across the hash table 
as much as possible. The questions are twofold. ‘The first is, how big should the table be? The second is, 


what is the magic function? 


Say the table holds M records and is indexed from 0 to M-1 inclusive, then the hash function obcys the 


following relation: 
0 < Hash(key) < M 


What is wrong with this idea? If the number of records to store in the table js n and n > M, there is 
more data than there are positions to store them. ‘Therefore, an obvious requirement is that the table is at 
Icast as large as the number of records to be stored: - 

(M 2 n) 


Note that these two requirements do not logically imply that no two keys have the same hash value. 
"The conditions are necessary, but not sufficient to insure that no two keys collide (have the same hash 


. code.) Suppose we have a hash table with 365 positions (M =365), and we have 23 keys (n=23). If each 


key is equally likely, the probability of at least one collision exceeds 50%! Therefore, we sce that 
functions which avoid duplicate valucs are surprisingly rare even with a relatively large table. This 
situation is isomorphic to what is known as the "birthday paradox" which states that if as few as 23 people 
get together, chances are good that at least 2 of them have the same month and day of birth. You might 


try to verify this fact empirically at the next large party you attended. 


Therefore, even if n < M. even much less than M, we still cannot insure zero collisions. Moreover, 
collisions are highly probable in addition to being logically possible. So we see that there is not perfect 
function, and tables will never be big enough! The problem becomes instead, what to do about the 


collisions? 
When using a hash table we must decide on two things: 
1. hash function to us¢ 
2. method of collision resolution 


‘Two types of hash functions work well. Onc is based on division and the other is based on 
multiplication. “The division scheme is the simplest and given below 
hash(key) ::= key mod M 


_ The best choice is for M to be prime and therefore odd. 


collision resolution: ‘The most common collision resolution technique is called chaining. We simply 
chain together any colliding record from the table. The following diagram illustrates this structure; 
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Se a eal 


Directions: For each of the following decide which is the best choice. 


1. Searching an unordered set of data requires a statement with 


I div which divides the interval in half. 
II. a Boolean variable to test if an element is found. 
Ill. a variable of type Pointer to traverse the list. 


(A) 1 

(B) Il 

(C) Il 

(D) I&Il 
(E) 1, 1 & Il 


2. A characteristic of the data that binary search uses but the linear search 
ignores is the 
(A) length of the list 
(B) order of the list 
(C) maximum value in the list 
(D) number of dimensions of the list 
(E) mean of the data values 


3. The order of magnitude of the worst case performance of the linear search is 
(A} Nlog.N ; 
(B) N? 
(C) N/2 
(D) LogıN 
(E) N 


4. The order of magnitude of the worst case performance of the binary search is 
(A) NlogıN 
BN 
(C) N/2 
(D) Log.N 
(E) N. 


5. The order of magnitude of the worst case performance of the search of an 
ordered binary tree is ; 
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Table 9-1: Order of Number of Comparisons 





Structure/Algorithm Search Sea Insert Delete 
average re average average 
unsorted array n e c n 
sorted array n n n ve 
binary array search log(n) log(n) n n 

bit vector c c .¢ x c 
hashing e ‘S22 c c 
binary scarch tree log(n) n log(n) . log(n) 


Note that inserting and deleting for the worst casc has the same order analysis as the average case 
except for hashing and binary search tree. In worst case it is O(n) for deleting for hashing and O(n) for 


inserting for a binary search tree. 


9.9 Exercises 
_1. Write a Pascal program which implements the algorithm for sequential search. 


2. Write a Pascal program which implements the algorithm for ordered sequential search. ` 


3. Write a Pascal program which implements the algorithm for binary array search. 
4. Write a Pascal program which implements the algorithm for bit vector. 
5. Write a Pascal program which implements the algorithm for hashing. 


6. Write a Pascal program which implements the algorithm for binary tree search. 


- 
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(C) for all cases except when A[N] equals X 
(D) for all cases except when A[1] equals X 
(E) under no conditions ; 


A search procedure which associates an address with a key value and provides 
a mechanism for dealing with two or more values assigned to the same 


address is called 

(A) linear search 

(B) binary search 

(C) linked lists with pointers 
(D) hash coded search 

(E) radix search 


The following representation of a data structure 
sl peL Me 
ak plal. 
Eee 


o 
eTa 


Pas 
5 |m 

would result from using key file values 4, 9, 10, 12, 16, and 20 and a hashing 

function of ; 








(A) mod 5 
(B) mod 6 
(C) div 5 
(D) div 6 
(E) log:6 


When key values are not reals, a similar data representation might be pro- 
duced by using a hashing function with 

(A) mod 

(B) div 

(C) trunc 

(D) log:N 

(E) none of the above 


The order of magnitude of the worst case performance of a hash coded search 
is 

(A) N? 

(B) Nlog,N 

(C) N 

(D) log.N 

(E) not dependent upon N. 
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Multiple Choice Questions 


(D) Log,N 
(E) N. 


Use this program segment to answer questions 6 and 7. 


10. 


Large := AÍ[1]; 
for ]:= 2 to N do 
i. 


Large := A{I]; 


To search A for the largest of its N values the missing lines could be 
(A) if A[J] < Large then 
(B) if A[7] > Large then 
(C) if Large > A|I] then 
(D) if Large < A{I| then 
(E) if Large > A[N] then 


This program segment, when completed, is an example of 


(A) hashing 

(B) linked list search 
(C) binary search 
(D) linear search 

(E) merge search 


Searching an unordered sequential list when the values are randomly distrib- 
uted through an array will produce a match on the average at position 

(A) N 

(B) N/2 

(C). N? 

(D) Nlog.N 

(E) log.N 


A maximum number of passes for a binary search of 500 items is 
(A) 500 

(B) 100 

(C) 10 

(D) 2 

(E) 250 


The following program segment 
AIN + 1] := X; 
Position := 1; 
while A[Position] < > X do 
Position := Position + 1; 


. can be used to search a list of length N for the position of X 


(A) for all values of N 
(B} for all values except N = 0 
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20. Given the values of 3, 8, 12, 17, 19, and 20 in an ordered list, the value 17 will 
cause Found := True 
(A) on the 1* pass through the loop 
(B) on the 2™ pass through the loop 
(C) on the 3” pass through the loop 
(D) on the 4" pass through the loop 
(E) under no conditions 


7 | 
B 
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Multiple Choice Questions 


Use the following program segment to answer questions 15-20. 


Left := 1; 
Right := N; 
_ Found := false; 
while (Left <= Right) and not Found do 
begin ; 
Position := (Left + Right) div 2; 
if Value < AlPosition| then 
Right := Position — 1 
else if Value > A[Position] then 
Left := Position + 1 
else 
. Found := true; 
end; 


15. This program segment illustrates 
(A) a binary search to find the position for Value in A 
(B) a linear search to find the position for Value in A 
(C) a binary sort to place Value in A 
(D) a linear sort to place Value in A 
(E) none of the above 


16. When Found := true then ` 
(A) Position tells where Value is 
(B) Left tells where Value is 
(C) Right tells where Value is 
(D) A{Position] tells where Value is 
(E) Value tells where Value is 


17. If Value is not Found then 
(A) Position tells where Value should go 
(B) Value tells where Value. should go 
(C) Left tells where Value should go 
(D) A[Position] tells where Value should go 
(E) none of the above is true 


18. When Value is larger than all values already in the list 
(A) Left := N + l; 


(B) Left := N; 
(C) Left := N — l; 
-(D) Left := 1; 


' (E) Position := (Left + Right) div 2; 


19. The approximate search interval after one unsuccessful pass when 
Value < A[Position| 


will be 

(A) Al1| to AIN] 

(B) A[1] to A[N/2| 
(C) A[N/2] to AIN] 
(D) A[1] to A[N/4] 

[E] none of the above 
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7. What is the maximum number of elements that would be checked by a 
binary scarch on a 15 element list? - 


a) 2 
b) 4 i 
c) 6 
d) 10 
e) ll 


Questions 8-14 are based on the following list of names: 


1 2 3 4 5 6 z 


The selection sort with exchange as described in the text is used to sort the list in 
ascending (low to high) order. 


8. Which two names would be exchanged first by the selection sort? 


a) Amy and Bud 
'b) Amy and Pat 
c) Bud and Sam 
d) Sam and Jon 
e) Sam and Nan 


9. Which two names would be exchanged second by the selection sort? 


a) Bud and Sam 
b) Nan and Pat 
c) Max and Sam 
d) Sam and Jon 
e) Sam and Nan 


The list is now in order and the binary search as described in the text is used to 
search for "Bob": 


1 2 3 4 5 6 7 


10. What is the second name checked in a binary search for "Bob"? 


a) Max 
b) Pat 
c) Amy 
d) Bud 
e) Jon 


1l. Whatis the third name checked in a binary search for "Bob"? 


a) Sam 
b) Pat 
c) Amy ; ; = 
d) Bud 
e) Jon 
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Chapter AP 2 Test 


PART I Choose the best answer for each of the following questions. 
Ñ 


"Searching" is 

a) a method of proving that a program has no bugs. 

b) the process of checking a group of data for a specific value. 

c) the process of testing a subprogram with a wide variety of data. 

d) using an array to store information about a group of data. 

e) looking at each valuc in an array after the execution of a subprogram 
has finished. 


An "iteration" is 

a) one complete execution of the body of a loop. 

b) a process used only by the selection. sort. 

c) an unexpected error condition. : 

d) the process of reading (or writing) a record to a file. 

e) a group of comments which describe the input, output and actions of a 
program or subprogram. 


When a group of records is sorted based on the value stored in one field 
of the record, that field is called: 


a) the pivot field. 

b) the target field. 

c) the object field. 

d) the key field. ; 

e) No special name is given to the field. 


To use a linear search, an array must: 


E be sorted. 5: 
II. have valid elements stored in positions 1..Length. 
III. have a Length equal to a power Gt-2; é 


a) I only. 

b) II only. 

c) IHI only. 

d) I and II only. 
e) II and III only. 


. To use a binary search, an array must: 


I. be sorted. 
Il. have valid elements stored in positions |..Length. 


Ill. have a Length equal to a power of 2. 


a) I only. 

b) II only. . 

c) III only.” 

d) I and II only. 

e) I, II and III. - aa 


What is the maximum number of elements that would be checked by a 
linear search on a 10 element list? 

a) 3 ; aa a 

b) 4 

cys 

d) 10 

e) ll. 


¢ 
pee 
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18. The best sorting algorithm to use is: 


a) the selection sort. 

b) the selection sort with exchange. 

c) the insertion sort. ie 

d) All three sorts are equally efficient. 

e) The best algorithm depends on the particular application. 


19. What is the result of the following calculation? 
(5 + 7 div 2) 


20. What type of subprogram would most likely contain the following code? 
Middle := (Low + High div 2); 


a) a binary search 

b) a linear search 

c) an insertion sort 

d) a selection sort 

e) All of the above would contain a similar calculation. 


PART II : $ A 

a) Perform an insertion sort using box diagrams and a temporary list on the 
following list of data: 45, 16, 27, 33, 82, 51, 22. Show the contents of the 
temporary list as each item is entered. 


b) Show the elements checked when a binary search is applied to the above sorted 
list for each of the following search values: 
Ti: 
45: 
Zk 


PART III Given the following definition of ListType 


ListType = record 
Item :array [1..MaxList] of InfoType;. 
.Length :0..MaxList ; 
end; (* ListType *) ; 


write each of the following using good programming style. Include pre- and 
postconditions in each subprogram. on 


a) Write a Boolean function named Sorted which returns TRUE only if its 
ListType parameter is sorted in ascending order. Consider an empty list as not 
sorted. 


peo 
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13, 


14. 


15. 


16. 
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. What is the minimum number of names that would be checked by a 


linear search of the list? 


a) 1 
b) 2 
c) 3 
d) 4 
e)7 


What is the minimum number of names that would be checked by a 
binary search of the list? 


a) 0 
b) 1 
€)-2 
d) 3 
e)4 


An insertion sort is used to place the name "Bev" in the list shown above. 
What is the index value of "Nan" after the sort terminates? 


a) 3 
b) 5 
c) 6 
d)7 
e) 8 


A general program which is used to check different versions of a 
complex subprogram for validity is called a: 


a) "wind tunnel" program. 

b) exhaustive use program. 

c) normal use simulator. 

d) "testbed" program. 

e) No special name is given the program. 


The following code fragment 


while (Pos < MaxList) and (List[Pos] <> Target) do 
Pos := Pos- 23... j 


would most likely be found in a subprogram which performs a: 


a) linear search. 

b) binary search. 

c) selection sort. 

d) exhaustive test of a sorting procedure. 
e) postcondition visual check. 


Given a large set of ordered data, the most efficient search is: 


a) the linear search. 
b) the binary search. 

c) the selection search. 
d) the exchange search. 
c) the merge search. 
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- 12-23. Assume you have the following data: 4 2 5 3 1, and that you want to place it in 
descending order from left to right. If you use the bubble sort, how many 
exchanges would you have to make before the list is sorted? 

a. 3 c: 6 
b. 4 fa! d. 7 


12-24. The major deficiency of the quicksort is 
a. it uses a large amount of memory space 
b. itis not efficient when dealing with large amounts of data 
c. determining the pivot point 
d. it tends to be error prone 


12-25. In one pass of a bubble sort, comparisons need to be made. 
an C. a8 
b. n+1 d. n-1 


12-26. What does it mean when a bubble sort makes an entire pass without making an 
exchange? 
a. The bubble sort contains an error in its code. 
b. The list is sorted. 
c. There is an error in the input data. ` 
d. Memory space has been exceeded. 


12-27. Ifa list has 40 elements, what is the maximum number of comparisons needed to sort the 


list using a quicksort? 
a. 40 c. log 40 
b. 80 d. log2 40 


12-28. Looking through a list for a particular element after the list has been sorted in a specified 
order is referred to as š ; 
_ a, seeking c. searching 
'b, locating d. finding 


12-29. How does the sequential search work? 
a. The list is repeatedly cut in half until the desired element is found. 
b. Starting from the middle of the list, each consecutive list element is examined until 
the desired element is found. 
c. Starting from the end of the list, each consecutive list element is examined until the 
desired element is found. 
d. Starting at the beginning of a list, each consecutive list element is examined until the 


desired element is found. 
12-30. Sequential search is also referred to as searching. . 
a. linear c. differential 
b. straight d. unary 


12-31. Which of the following statements about sequential search is false? 
a. The list which is to be examined need not be in any particular order. 
b. When dealing with large amounts of data, it uses a greal deal of CPU time. 
c. The smallest number of searches is one. 
d. The maximum number of searches for a list of N elements is N/2. 


. 


G 


Ex 
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12-14. How does a bubble sort work when sorting elements in an array in either an ascending 


or descending order? 
a. Adjacent values in an array are compared and if they are not in sequence, they are 


exchanged. 
b. Alternate values in an array are compared and if they are out of sequence, they are 


exchanged. 
c. The list of elements in an array are continually divided into sublists until there is only 


one element in each sublist. 
d. The list of elements are divided into two lists. The adjacent elements of each list are 


then compared and exchanged if they are out of sequence. 





12-15. One of the fastest sorting methods is the sort. 
' a. merge c. bubble 
b. quick d. shell 
12-16. Assume you have a list of 1000 integer numbers to sort and place in ascending order. 
The most efficient sort routine for this task is the sort. 
a. merge. - c. bubble 
b. quick d. binary 


12-17. Assume you have a list of five integer numbers: 25 47 8 which you want placed in 


ascending order. The best sort routine for this task is the sort. 
a. merge Ce bubble _ : 
b. quick d. straight es. 


12-18. If two lists are to be merged, which of the following must be truc? 
a. The two lists must be of the same length. z 
b. They must not contain any repeated elements. 
c. The two lists must be of different lengths. 
d. They must be sorted. 


12-19. When performing a merge sort, the maximum number of comparisons to be made is 


a. the product of the lengths of the lists 

b. half the sum of the lengths of the lists 

c. the square of the number of items in the lists 
d. the sum of the lengths of the lists 


Assume you have the following data: 86543 1, and that you want to place it in 
ascending order from left to right. If you use the bubbie sort: 


12-20. How many comparisons would you have to make before the list is sorted? 


zS ¢, 18 
b. 14 d. 16 
12-21. How many exchanges would you have to make before the list is sorted? 
a. e cadis 
b. 14 d. 16 


12-22. Using the same data as above, but, using the quicksort, how many exchanges 
would you have to make before the listed is sorted? 


a. 4 Cc. 
b. 5 Pee 
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12-38. Use the bubble sort for the following list: 45312. Which of the-following is the 
correct sequence of steps to complete pass 1? 


a. 45312 c. 45312 
43512 ; 45312 
43152 43512. 
41325 43152 

43125 

b. 45312 d. 45312 
45312 45123 
43152 41235 
43125 12345. 


2 
gS 


Use the following programs to answer questions 39 
Assume the following declaration and that List is alr 


£ 
Q. 


y 


program BubSort (input, output); 
const 
Num = 10; 


type 2 
Arrayl = array[1..Num] of integer; 


_ var 
Exch : boolean; 


I. procedure BSort (var List : Array 1); II. procedure BSort (var List : Array1); 
s var . ` 


var 
Temp, J, NumPass : integer; Temp, J, NumPass : integer; 
begin (* BSort *) ; begin (* BSort *) 
repeat at 
Exch := false; Exch := false; 
’ NumPass := Num- 1; NumPass := Num - 1; 
for J := 1 to NumPass do for J := 1 to NumPass do 
PK begin begin 
if List[J] < List{J+1] if List{J] > List{J+1] 
then : —- hea 
begin 
List(J] := List[J+1]; List{J] := List(J+1); 
List{J+1] := Temp; List{J+1] := Temp; 
Exch := true; Exch := false; 
end; (* if *) end; (* if *) 
end; (* for *) end; (* for *) 
Num := Num -1; 5 Num := Num -1; 
until Exch = false; until Exch = true; 


end; (* BSort *) end; (* BSort *) 


b 

| 

b 

b 

| 

i 

i} 

i 
i- 

i 

| 
Temp := List; Tite ast: | 
l 

i 

i 

| 

i 

4 
a, 





12-32. 


12-33. 


12-34. 


12-35. 


12-36. 


12-37. 


How does binary search work? 5 

a. Starting from the middle of the list, each consecutive list element is examined until 
the desired element is found. ; 

b. Starting at the beginning of the list, each consecutive list element is examined until 
the desired element is found. 

c. The list is repeatedly divided in half, with the part not containing the target element 
being discarded. This process continues until the desired element is found. 

d. The list is simultaneously searched from the end and the beginning of the list. 


If a list has 50 elements, the maximum number of elements that needs to be examined in 
a binary search is 
a. 50 c. log2 50 
b. 100 Pe 





Which of the following comparisons between the binary search and sequential search is 

true? 

a. Ina binary search, the list need not be sorted; in a sequential search, the list must be 
sorted. 

b. As the size of a list grows, it becomes more advantageous to use the binary search 
over the sequential search. 

c. The maximum number of elements to be examined for a list of N elements using the 
binary search is N; using the sequential search, it is log2N. 

d. The binary search requires more CPU time than the sequential search. 


a 


The number of comparisons needed in a binary search grows , while the | 
number of searches needed in a sequential search grows ; 
a. logarithmically, linearly c. logarithmically, exponentially 
b. linearly, exponentially . d. mathematically, linearly 
The Towers of Hanoi is an example of using a : 

~a. sequential search c. looping construct 
b.` quicksort i d. recursive solution 


‘ What would the partitioning sequence look like if the following numbers were sorted in 


ascending order using a quicksort? 45312 


a. 45312 : c. 45312 
21354 | 41352 
12345 21352 

12345 

b. 45312 d. 45312 
25314 25314 
12345 . 21354 

12345 
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III. procedure BSort (var List : Array1); 


var 
Temp, J, NumPass : integer; _ 
begin (* BSort *) 
repeat 
Exch := false; 
NumPass := Num - 1; 
for J := 1 to NumPass do 


begin 
if List[J] > List[J+1] 
then 


begin 
Temp := List[J]; 
List{J] := List(J+1); 
List{J+1] := Temp; 
Exch := true; 
end; (* if *) 
end; (* for *) 
Num := Num -1; 
until Exch = false; 
end; (* BSort *) 


IV. procedure BSort (var List : Array 1); 


"Temp, J, NumPass : integer; 
begin (* BSort *) 


repeat 
Exch := false; 
NumPass := Num - 1; 
for J := 1 to NumPass do 
begin 
if List{J] > List{J+1] 
then 


begin 
Temp := List[J]; 
List[J] := List[J+1]; 
List{J+1] := Temp; 
Exch := false; 
end; (* if *) 
end; (* for *) 
Num := Num -1; 
until Exch = false; 
end; (* BSort *) 


12-39: Which of the bubble sort procedures above will sort List in ascending order? 
a. I . 


b Me 


c. Til 
d- D 


12-40. Which of the bubble sort procedures above will sort List in descending order? 
E ; 
d. IV 


a4 
b. Il 


animem ez eŢvu 
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EXAMPLE: Assume 1" pass:n = 8, i = 8, Large = A[l] = 10, k = 1. 







A[6] = A[8] 
A[8] = Large = 45 ¥ 


The 1* the end 


-of the file. 
File after one pass is: 10, 28, 2, 7, 9, 40, 25, 45 


pass finds the largest element, 45, in A[6] which it switches to 


2™ pass: i = 7, Large = A{l] = 10,k =1 


A[6] = A[7] - ri D ee 
A[7] = Large = 40 


The largest of the reduced array is 40 in A[6] which 


ich is switched to A[7f 
File after second pass is: 10, 28, 2, 7, 9, 25, 40, 45 oe rien 


3" pass: i = 6, Large = A[l]=10,k =1 


A[2] = A[6] 
A[6] = Large = 28 


File after third pass is 10, 25, 2, 7, 9, 28, 40, 45 
> fourth 7- “119, 2 7, 9, 25, 28, 40, 45 


Oe eS 9, 2, 7, 10, 25, 28, 40, 45 
Mt Bee See e 7, 2, 9, 10, 25, 28, 40,45 
DRN seventh(last) 2, 7, 9, 10, 25, 28, 40, 45 
Analysis: 


_ The number of comparisons during the first pass is n 


— 1. For each successive 
pass, the number decreases by one. The total number 


of comparisons, then, is 





‘oom os oe m oe os oe os os oF os oe os 
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2™ pass: k = 2. Temp = A[2] = 28, i = 1, Found = false. 
Temp < A[1] 
Found = true 
A[2] = Temp = 28 





(sorted) . 


3" pass: k = 3, Temp = a 2, i = 2, Found = False. 
Temp < A[2] : 
_A[3] = AQ] = 28,i=1. - 
Temp < A{l1] 
A[2] = A[1] = 10, i = 0 (Exit while loop) 
_ Ali + 1] = A[]] = Temp = 2 ` 





(sorted) 


4" pass: k = 4, Temp = an = 7, i= 3, Found = false. 
Temp < A[3] s 
A[4] = A[3] = 28,i = 2 
Temp < A[2] 3 
A[3] = A[2] = 10,i= 1. 
Temp < > A[l] . 
Found = true (Exit while loop) 
Afi + 1] = A[2] = Temp = 7 








4 
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then begin (*interchange*) ` 
Interchange := true; 
Temp := Afi]; rg EES 
Ali] := Ali + 1]; °° al 
Ali + 1] = Temp 
end; : 
Passnumber := Passnumber + 1; i 
end : 





=a" past: 
Interchange = true, Passnumber = i 
Interchange = false 2 
i= 1 Aji] > A[2] > noes 
i = 2 A{2] > A[3}, AP DN EE = true. Exchange 28 and 2. 


- i= 3 A[3] > A[4], Interchange = true. Exchange 28 and 7. -: 


i = 4 A[4] > A[5], Interchange = true. Exchange 28 and 9. 
i = 5 A[5] > A[6] 

i = 6 A[6] > A[7], Interchange = = true. Exchange 45 and 25. 
i = 7 A[7] > A[8], Interchange = true. Exchange 45 and 40. 
Passnumber = 2 





2m pass: 
Interchange = faleni an l Ea 
i = 1 A[1}] > A[2], i = true. pakk 10 indi 2. fj 
i = 2 A[2] > A[3], Interchange = true. Exchange 10 and 7, . 
i = 3 A[3] > A[4], Interchange = true. Exchange 10 and 9, x 


i = 4 A[4] > A[5] 
i = 5 A[5] > A[6], Interchange = true. Exchange 28 and 25. k 
i = 6 Al6] > A[7}. i ene 


Passnumber = 3 5- f F 


Array remains the same. No interchanges oecusted: Baits outer saehile loop because 


Interchange = false. 
Analysis: = : s 5 rae 
In the worst case, there are n — 1- passes and A ke pass involves n`— k 
comparisons. Thirelore, there are (n-1)+(n-2)+...4+1 = l= Hin —2) 


comparisons and bubble sort is O(n’), A more detailed analysis would show that in 
general there are fewer comparisons, but that the number of interchanges can add 





2 


E 
f 
CHAPTER 6 


Sorting Algorithms | | 


1. Selection Sort 


The objective of a selection sort is to place successive key values into their 
proper position. At the end of the first pass over all key values, the largest (or 
smallest) value will be in its position. After the k® pass, the k™ largest (smallest) 
value will be in its position. One example of selection sorts, coleg straight selec- 
tion sort, follows. 

Algorithm: Selection sort for a file of n key values. Sorts i into aonde: 
scending order in an array A. The indices of A are i, j, and k; Large is an integer. 


begin 
for i := n downto 2 
do begin ("place largest number of All] through Afi] into Large and its index 
into k*) 
Large := Afl]; 
k:=1 ; 
for j := 2 to i do 
if A{j] >. Large 
then begin 
Large := Ali); 
k:=j; . 
end; 
- A[k] := Ali]; (*place large into position i*) 
Ali] := Large; 
end 
end; 


e [iio EnEn ince | 
ho [as [a pro [as [25] a0 | key values | 
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Switch to the Jet pointer. 
left = left + 1 = 3 (right > lett) 
Compare temp with Alleft] = A[3] = 2. No interchange. left = left + 1 = 4 (right < 
> left] = 
Terminate. 


At this point, A is 





All the key values to the right of 10 are greater than 10. All key values to the 
left of 10 are less than 10. Two new pairs of values for left and right now exist, 
namely (1,3) and (5,8). The pair representing the larger subfile is placed on a stack; 
the shorter file is processed immediately using quicksort. At any point in the - 
process, the stack contains pairs of left and right pointers of all subfiles still to be 
sorted. Each recursive call of quicksort for a subfile pops one pair from the stack. 


< haalii: 

A detailed analysis is quite difficult. We consider a special case in which the 
original file contains n = 2* key values and each splitting element divides its 
subfile into two equal parts. (This ‘represents the ideal situation in quicksort.) The 
first pass, then, involves approximately n comparisons (really n — 1) and splits the 
file into 2 subfiles of approximately n/2 elements each. The number of compari- 
sons for each subfile is approximately n/2 for a total of 2(n/2) comparisons: Each of 
those subfiles is split into two subfiles, making 4 subfiles each of (n/4) elements. 
In general, at the k stage, we can consider 2* subfiles each of n/{2*) elements. The 
number of comparisons required is approximately 2*(n/2*) = n. (Really, fewer com- 
parisons are needed.) The total number of comparisons is n times the number of 
times the file is subdivided. But that number is the value of k for which 2* = n, or 
logon. Quicksort, therefore, is O(nlog:n) in the best case situation. It is also 
O[nlog,n) in the average case for any file size, but O(n?) in the worst case, namely 
when the original file is already sorted. Memory requirements for quicksort in- 
clude the stack whose size depends on the number of recursive calls and an addi- 
tional location for Temp. Patt gee 


4. Merge Sort 


The objective of a merge sort is to merge [i.e., combine two or more sorted 
files into a sorted file) sorted subfiles into increasingly larger sorted files until one 
sorted file results. Before we show an example of a merge sort, we illustrate the 
concept of merging two sorted files into a third sorted file. (Merging is one of the 
basic processes in data processing where the sorted files are frequently referred to 








(a = 1) + (n - 2) + {n- 3) + ... #34241 = nln) 


This is an O(n’) method.’ In addition, there are n — 1 interchanges and only a 


few extra storage locations for interchanges. No matter how the file names are 


arranged initially, selection sort requires the same number, O(n’), of comparisons. 
It should be used only on small files. 

S . > > : ; y 
È i 


2. Insertion Sort 


. The objective of an insertion sort is to place successsive key values into an 
existing sorted file. The first key value is assumed to be a sorted file of one 
element. The second key value is then inserted in that file to form a sorted file of 
two elements. In general, the k key value is inserted into a sorted file of k — 1 

elements to form a sorted file of k elements. An example of a simple insertion sort 

will be given; shell sort (sometimes called diminishing increment sort) is another 

insertion sort which will not be included. 

Algorithm: Sorts key values into nondescending order in an array A. A[l] is a 

sorted file of one element. Elements Afi] through Afk] are in order after each pass. 

The indices of A are k, i. Temp is an integer which holds the element to be 
inserted. found is boolean. 


begin ; : Ser Se i 
for k := 2 ton ; . 
do begin (*insert A[k] into the sorted file*) 

Temp := Afk]; % 

i:=k-1 ; 

Found := false; 2 

while (i > = 1) and (not Found) {*move all key values > temp*} 

do if Temp < Ali] ` ; i 
then begin 






; Ali + 1] m Ali]; 
i:=i-1 
end; 
else Found := true; | $ 
Afi + 1] := Temp(*insert temp in proper position") 
end ; 
end; 
EXAMPLE: at 
rune fi alse Je 





Initially A[1] = 10. 


jo | 26 [2 |7 |o Jas [25 | so | xey va 


| ‘For a discussion of the sum of an arithmetic series and O(n’), see Appendix C. 
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At each pass, every merge is performed according to the preceding example. 


Analysis: ; 

A rough analysis of merge sort is similar to quicksort. Each pass requires 
approximately 2*(n/2*) = n comparisons and there are k = logn passes maximum. 
Merge sort, therefore, is O(nlog.n|. There are variations of merge sort which are 
slightly more efficient but still O(nlog.n). ` 


Sig 


5. Sorting/Searching Using an Ordered 
7 Binary Tree | 


An ordered tree provides a convenient structure for a method of sorting key 
values which also facilitate searching for key values. One key value is designated 
the root of an ordered tree. Each successive value is compared to the root, then to 
roots of left or right subtrees depending on whether the key value is less than or 
greater than the value at the root. After all key values are inserted, a sorted file can 


be obtained by traversing the tree in inorder. To search for an element, utilize . 


similar steps as in constructing the tree. Either the element will match a value at a 
root or an empty subtree will be reached (indicating the element is not in the tree). 
Algorithms: Be Gas 
1) Using a tree to sort key values, we assume the existence of four 

procedures: .. Ay ase ne ee aren eae 
(a) Createtree (KeyValue). Creates a new binary tree. consisting of a 
single node. KeyValue is the value in the information field of the 

node. Returns a pointer to the node. 


(b) LeftOffspring (Ptr, KeyValue). Accepts a pointer Ptr to a binary tree 


node with no left offspring. Creates a new node as left offspring and 
inserts KeyValue into its information field. Error condition if Ptr 
points to a node with a left offspring. 


: (c} RightOffspring (Ptr, KeyValue). Similar to LeftOffspring but. 


creates a right offspring. 


(d) InorderTraversal (Tree). Traverses Tree in inorder and prints con- 
tents of each node (i.e., traverse left subtree in inorder, print con 
tents of root, traverse right subtree in inorder). ; 

In addition, assume a file with key values k,,k,,...,kn; iis the index of key 
values. Temp is an integer used to hold a key value and p, q, t are pointers to tree 


nodes. Nodes are 


i 





Ġa m a eo |w m w e a m a sw eee ee M 
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- 7a pore ; ee 
Èi drl] slalo 
8" pass: 


el? [s [els ao 


Analysis: to 
Only one comparison is made for each pass if the original key values are 


already sorted. This best case situation is O(n). When the initial values are sorted 
in reverse order, then the total number of comparisons is: 


In-1+(n-2)+...+34+241= 20 
This worst case situation is O(n’). It can also be shown that the average case is 
also O(n’). Only one extra memory location (for Temp) is required. There are ways 
to improve the efficiency of insertion sorts (e.g., shell sort). Note that a linked list 
could be used instead of an array, but the number of comparisons will still be 
O(n’). ees 


3. Exchange Sort 


The objective of exchange sorts is to interchange successive pairs of key val- 
ues as necessary until one value attains its proper place in the file. Common 
examples of exchange sorts are bubble sort and quicksort. 


a) Bubble Sort 


` Bubble sort is easy to understand and program but is less efficient than most 
other sort methods. Each pass involves comparing successive pairs of key values 
and interchanging only those which are not in proper order. After k passes, the k 
largest (smallest) values are in place. The process terminates at the end of the first 
pass which does not have any interchanges. 

Algorithm: Sorts key values into nondescending order in an array A. Key 
values A[n—(k — 1)],..,A[n] are in order after the k® pass. The index of A is i, 
passnumber is an integer (I,... ,n) which keeps track of the number of passes. 
Temp is an integer used in interchanges. The variable Interchange is boolean. 


` begin 

Interchange := true; 

Passnumber := 1; 

while (Passnumber < = |n — 1j) and (Interchange) 


do begin ("this loop controls the number of passes*} 
Interchange := false; © 


-fori := 1 to (n — Passnumber} = 
-` ` do if Afi] < Afi + 1] (*key values are out of order’) | 


esa | =a z3 = = = 3 > . ` 
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tas eS 5 | z p= 8: 


AO ee 
ae ae a a > e Cm 
7 Hert ABs 75 eS >a RE N -,25 =- 
w Rec ‘ 40 
; N 2 9 : 
= ; : 
ii) Traverse tree in inorder and print results. Nodes in order of print- 
ing: 2, 7, 9, 10, 25, 28, 40, 45. . RSF dia 
iii) Search the tree 
a)k, = 7 eee 
Found = false. p, t point to root 10. 
7:4 240. 
7<10 os 2 
: Follow left pointer of 10, p points to root 2. ` > 
: ede he OS 5 3 
tai È 
Follow right pointer of 2. p points to root 7. 
2 7=7 ; 


=a 
4 
| 
| 
| 

[l 

f 

i 
; 

îi 
a 
q 

i 

B 

; 

p 

i 

B 
a 


Found = true | 
Search is successful. 


b) k, = 27 aga Sis Bi fel: 
Found = false. p,t point to root 10. ` gata noaa 
27<>10 ot i Les iad 
ew 2 = lee sd 
Follow right pointer of 10. p points to root 28. 

27 < > 28 aA 

27 < 28 
Follow left pointer of 28. p points to root 25. > = ) 
27 < > 25. ak ab a pads 
273-> 25 aes, Se 

Follow right pointer of 25. p = nih- = sco 7 

Exit loop. 171? Pie L PO Cpl 

k, = 27 is not in tree. OT 


2 
Analysis: . A 1 oe 7 7 

In the worst case, the file is already sorted (or is sorted in reverse order). Then 
the ordered binary tree is merely a linked list. In that case, the binary tree sort is 
O(n’) and binary tree search is O(n). In the best case, each root has the same 
number of leftsubtrees as rightsubtrees. Then half the original values are less than, 
and half are greater than, the root of the tree. Each time a comparison is made, half 
the remaining elements are deleted from consideration. The resulting number of 
comparisons is similar to that in a binary search or in quicksort. Therefore, binary 
tree sort is O(nlog,n) and binary tree search is Oflog:n). For.a file of randomly 
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significantly to the time of the algorithm. Most commonly, bubble sort is said to 
be O(n’). Because of the interchanges involved, it is less efficient than selection or 
insertion sorts in the same O(n’) category. Only one additional memory location is 
required for Temp. Bubble sort could be considered if it is known that the original 
file is already sorted (or nearly so) because only one pass would then be required 
` and the method becomes O(n). Various methods exist to improve bubble sort, but 
not significam enough to recommend. 


` b) Quicksort 


Each pass of quicksort (or partition exchange sort) splits (or partitions) the 
existing file into two subfiles such that all elements in one subfile are less than a 
specific element (the splitting key} and all elements in the second subfile are 
greater than the element. Initially pointers (indices when an array is used} are set 
to the leftmost and rightmost key values in the original file. A key value (splitting 
element} is chosen and is compared to all other key values. Interchanges occur. 
until the two subfiles are obtained and the value is in its proper position relative to 
the entire file. The process is repeated on each subfile, until the entire file is. 
sorted. 

: We will give an example and brief analysis of quicksort. The algòrithm can be 
written as a recursive procedure. 


i i EXAMPLE A: 





Let left = index to leftmost key value 
Let right = index to rightmost key value 
Initially left = 1, right = 8. 

Choose 10 = Ali] as the splitting element. 
Set temp = 10 

Compare temp with Alright] = Als} = 40. No iit (40 > 10). 

right = right — 1 = 7 (right > left). 

Compare temp with Alright] = A[7] = 25. No interchange (25 > 10). 

‘right = right — 5 = (right > left) 

Compare temp with Afright] = A[6] = 45. No interchange (45 > 10). 

right = right. — 1 = 5 (right > left). 

Compare temp with Al[right] = A[5] = 9. Exchange 9 and 10. (An actual 

exchange does not take place. 9 is inserted in Afl] and the index right = 5 is 
maintained as the possible location for temp. Think of 10 = A[5}). 


Because an “exchange” occurred, switch to the’ left pointer. left = left + 1 = 2 

(right > left) l 

Compare temp with Ajleft] = A[2] = 28, Exchange 28 and 10. (That is, insert 28 in 
Alright] = A[5] and maintain the index left = 2. Think of 10 = ADI ) 

Switch to the right pointer. . ; 

right = right — 1 = 4 (right > left] 

Compare temp with Afright] = A[4] = 7, Exchange 7 iòd 10. (insert 7 in Allee = 

_, A[2] and maintain right = 4. Think of 10 = A/4].) 


| 
| 
= 
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10. Sorting 


Computing systems must be able to search quickly through large amounts of information. For 
example, a banking machine must not make customers wait long periods as it establishes the customer’s 
validity or performs the requested transactions. Similarly, an automatic telephone directory cannot make 
acaller wait long periods as it searches one record after another for the requested phone number. 


As we saw in the previous chapter, there are radically different speeds with which algorithms for 
searching converge, cither locating the desired item or discovering that the item is not present. This 
difference in speeds was, as we showed, an order of magnitude. ‘The fast algorithms were fast preciscly 
because they searched lists that were ordered. Such lists are ordered because the items are sequenced 
logically. "Fhe process of sequencing items logically, called sorting, facilitates locating desired items. 


Sorting is one of the most studied and best understood domains of computer science. There are dozens 
of general-purpose sorting techniques and many more special-purpose sorting algorithms, In this 
chapter, we will present seven of the more common general-purpose sorting algorithms and briefly 
analyze their performance in a variety of situations. To do so, we will include pscudo code and analysis 
subsections for cach algorithm. 


10.1 What It Means to be Ordered* ** 


Ordering is an apparently obvious phenoménon in our everyday lives. Naturally, we think of 
nuncrical ordering for integers, each clement larger than the last, or alphabetical ordering for items in the. 
author index of the library’s card catalog system. Yet ordering is full of subtictics. We necd a clear and 
precise understanding of what ordering is. 


10.1.1 What Is Being Ordered 


In the usual mathematical sense, sets of elements are ordered.’ ‘Too often in computer science, people 
get the impression that what is being sorted or ordered is some particular data structure, such as arrays 
and linked lists. Arrays, linked lists, and other data structures such as trees and even Pascal 
implementations of sets are realizations of what is being sorted. They are ways that current technology 
implements and represents orderings. ‘The thing being ordered is a set. To explain ordering, we will use 
formal mathematics notation. 


For the following discussion we will use S to denote a set of clements that is being ordered. S might be 
a collection of integers, a collection of character fields in a Pascal record, or some other ensemble. It isn’t 
important what the particular set happens to be. What is important is that the propertics discussed below 
are truc for each of these sets. : 


40.1.2 The Binary Relation 


Ordering is based on the notion that the elements of a set are related to one another. Informally, a 
relation is binary if it compares two elements at a time. For example, the following relations are intuitive 
and binary: John is taller than Joe. Phil is older than Lee. In these sentences, the binary relations are 


/ 


File Date: January Ist 1985 at 17:22 z Scribe Date: January 2nd 1985 at 00:36 
File Name: Cl IAPTERI3.MSS.115 line 3 Scribe Version: 4(1400) 
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as a master file of records and an update file of transactions. The two files are 
-merged into a new master file which in turn can be updated whenever additional 
transactions are collected into an update file.) 


EXAMPLE 
- Assume two files of key values: 
File 1: 2,7,9,10,25,40,45 
File 2: 5,8,10,18,20,30 


Elements to be compared 


From File 1 | From File 2 Result (smaller element written to File 3) 
2 3 a eee 
7 ; 5 
Be E 8 7 
9 8 8 : 
ets E 10-2: ».-). +i #. £2 ¥Ofeither one) 
ii 5 ee ae if: R EA S 
25 Ties $: RAE 18 a 
25 20 20 z 
25 30 - 25 
. 40 30. = PERE 
40 ttre í 40 
45 45 


(When one file becomes empty the remainder of the other file is written directly to 
File 3.) F hae 
` The example’ of a merge sort that we use is called a straight merge sort. 
Initially, divide the file into n subfiles of length 1. Merge adjacent pairs of subfiles 
into n/2 sorted files of length 2. Merge adjacent pairs of these subfiles into n/4 
sorted files of length 4. In general, merge adjacent pairs of subfiles into n/2* sorted 
files of length k. 


- EXAMPLE: Assume key values are 10, 28, 2, 7, 9, 45, 25, and 40. 


Original file _ Sio c pai iy 9 45 25° 40 
n = 8 subfiles of length 1. 





Ist pass: (10 28) aie (9 45) (25 40) 
n/2 = 4 subfiles of length 4. } 


2nd pass: ee iy: (2 7 - 10) a (9 25 40 45) 
n/4 = 2 subfiles of length 4. 







3rd pass: 


n/8 = 1 subfile oflength5. . _ (2 7.9 10 25 28 40 45) 
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the brother of. Clearly no one is the brother of himself, implying that NOT aRa holds for this familiar set 


and binary relation! 


Total 


‘This property means that all the ‘tems of a set are comparable. For example, all of the items may be 


integers, real numbers, or any other class of items that can be compared logically. People sometimes say 
that a comparison is like, "comparing, apples and oranges.” ‘The total property means that given a set S 
and a relation R, the set has no “apples and oranges” with respect to R. More formally, the property 
states that for every a,b in S: 


EITHER aRb OR bRa 


Note that this does not preclude both aRb AND bRa which would imply thata = b. ‘The important 
puint for you to understand is that sets of incomparable elements cannot be ordered logically. 


10.1.3 The Total Order 


Given a set S and a binary relation R which satisfies antisymmetric, transitive, reflexive, and total 
properties, we say that the set is totally ordered, The problem of sorting may be defined as follows: 
Given a subset selected from a set with a total ordering, arrange the clements in such a way that they are 
ordered. We have briefly seen sorting before. For example, we suggested in an exercise that you take an 


array of numbers and arrange them in increasing order. 


10.1.4 Internal Sorting 


Many early computers had only 4.000 cells of primary memory. Even today, computers with more than 
a few million primary memory cells are rare (although this is changing rapidly). Data bases upon which 
computers act are sometimes cnormous, with tens and even hundreds of millions of elements. If the 
memory of the computer is smaller than the number of elements that a program is to sort, then a portion 
of the set to be sorted must reside in secondary memory at all times. This kind of sorting compriscs an 
entire class of algorithms known as the external sorting algorithms. We are primarily concerned here 
with internal sorts, those in which the entire set of clements to be ordered is stored’ in primary memory 
while the sort is executed. Virtually all sorting routines discussed here are internal sorting routines. We 
will deal almost exclusively with internal sorting and not discuss sorting with tape drives or disk sorting. 
We do, however, present one sorting algorithm which is also suitable for external sorting. 


10.2 Insertion Sort 


Insertion sorts attempt to generate a sorted list in the following way: start with an empty list, insert one 
element into the list at a time, maintain a sorted list throughout the process. Thus, when all elements 
have been inserted, we will have a sorted list of all the elements. 


10.2.1 Pseudo Code 


If the unsorted elements are in unsortedf 1..NumberOfElements] and the sorted elements are to be put in 
sorted{L.. NumberOfllements], the following pscudo code demonstrates the insertion sort algorithm: 


Vile Date: January Ist 1985 at 17:22 Seribe Date: January 2nd 1985 at 00:36 
ile Name; CHAPTER 13. MSS.115 line 151 Scribe Version: 4(1400) 


i 






S 


dn e æ e ww p eS a on ca ae pees a = «a 


a =: en a m n = 
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begin 
tis Createtree(k,); (*first key value is the root*) 
for i = 2 to n (*repeat for each successive key value”) 
do begin ; 

Temp := ki; 

p:=t; 

while p < > nil (*travel down tree until a node is reached wich has no 

offspring*) 
do esi 


= P; 
i Temp gZ Infolp) - 
then p := Left(p) 
else p := Right(p) 
end; 
if Temp < Infolq) 
then LeftOffspring|q,Temp) - 
else RightOffspring lą, Temp) 
end; 
InorderTraversal\t) 


end; 


2) Searching an ordered binary tree. Assume a tree created as above and search is 


for key value k,. Let Found be boolean. z 
Found := false; ; S ; 
p:=t i 
while (p < > nil) and (not Found) : 
do if k, = Info(p) 
then Found := true 
else if k, < Info(p) 
then p := Left(p) 
else p := Right(p); 
if Found 
then search is successful 
else k, is not in tree. 


EXAMPLE: Assume key values are 10, 28, 2, 7, 9, 45, 25, and 40. 
i) Create an ordered binary tree. 
10 is a root. 
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PROCEDURE {nsertionSort (list). 


BEGIN 


FOR index + 2 to NumberOfElements 
BEGIN 
inner + index E = 
WHILE (inner > 1) CAND (listLinner] < list{inner-1]) 
BEGIN 
swap(listLinner], listLinner-1]) 
inner «- inner - 1 
END 
END 
END 


Again, the algorithin is illustrated below by the diagram. 


Vist list list list list 
E 
29 [13] | 13] 
7 | 29| 29| 
57 |57] 
61] [851 

index = 4 index = 5 





10.2.2 Analysis 


How many comparisons of cleme 
case, every clement to be inserted in the list must always be compared with every 0 
the list. For the worst case, the number of comparisons is the following sum: 


142434... + (ml) =a(n-)/2 = n?/2 - n/2 = O(n?) 


In the average case, we only compare each clement to be inserted with h 
cases, the number of comparisons is 1/2 * n(n-1)/2 = n(n-1)/4 = O(n?). 
already sorted. Hence we only need to compare cach element to one other element 
comparisons. We sce that for large n, a list which is already sorted or nearly sorted requires su 


fewer comparisons than if the list is in reverse sorted order. 


10.3 Bubble Sort 


The idea behind bubble sort is quite simple. The first 
they are out of order they are exchanged. Next the secon 
they are out of order, they are exchanged. Next the third 
sixth, etc. If we are sorting into increasing order, then the largest clement 
position of the list. After two passes the two largest elements have bubbled up in the last two positions. 
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ats in the list must be performed with this algorithm? In the worst 
ther elenient already in 


alf o£ the sorted list In such 
In the best case, the list is 
which isn- 1 = O(n) 
bstantially 


element is compared to the second element; if 
d clement is compared to the third element, if 
and fourth, then fourth and fifth, then fifth and 
"bubbles up” in the last 
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Sorting Algorithms 49 
distributed key values, the actual number of comparisons will lie somewhere 
between the best and worst cases, but the binary tree sort and search methods are 
generally considered to be Of(nlog:n}) and O(log,n), respectively. 

The following table indicates the advantage of using methods of O(nlog)n) 
rather than O(n?) for large files. For a small file the additional complexity of some 
O{nlog.n) methods make them less attractive and, in some instances, undesirable. 


















n nlog.n 

A 4 2 

E: 16 8 

8. 64 24 

16 256 64 

1024 1,048,576 10,240 
” 268,435,456 229,376 s 
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PROCEDURE SelectionSort (unsorted, sorted) 
BEGIN s 
FOR outer — 1 TO NumberOfElements 
BEGIN ae 
CurrentMhin unsorted[ 1] $ 
CurrentMinPosition t- 1 
FOR inner + 1 TO NumberOfElements 
IF unsorted[Linner] < CurrentMin 
THEN BEGIN 
CurrentMin unsorted[Linner] 
CurrentMinPosition — inner 
END 
sortedfouter] = CurrentMin 
unsorted[CurrentMinPos ition] + +00 
END. 
END 
unsorted Sorted unsorted Sorted unsorted Sorted unsorted Sorted unsorted Sorted sorted Sorted 
[13] fe}. 12 iC ae 2 oo ee Gi 
|29] Pe} 3 - La] ee p 
=| 29) = |29 |29] | 
57 = bd eC Bee | a 
[s] : Ze. ze Eà [61 e] [et 
outer = 1 outer = 2 outer a 3 outer = 4 outer = $ 
Note that we can modify the code so that it is performed in one list rather than two and need not use 


the infinity flag. 
PROCEDURE SelectionSort (list) 


BEGIN : 
FOR outer + 1 TO NumberOfElements 


BEGIN 
CurrentMin «+ list[outer] 
CurrentMinPosition = outer 


FOR inner + outer+1 TO NumberOfElements 
IF listLinner] < currentMin 


THEN BEGIN 
CurrentMin + listLinner] 
CurrentMinPosition + inner 


END 


swap(listLouter], list{CurrentMinPosition]) 


END 
END 


The modified selection sort is much more popular than straight sclection sort. 
* 
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taller and older respectively. For the expression 5 > 3, the binary relation is >. 


It is important to remember that a binary relation, R. is a property of a set, S. There are four propertics 
of binary relations that, when present, imply that set S is ordered with respect to binary relation R: 
antisymmetric, transitive, reflexive, and total. 


Antisymmetric 
‘This property allows for the equality of clements of a sct, but docs not demand it. Consider elements 
abofset S. 


aRb& bRa=>a=b 


=æ æm = o op an 


Suppose that S is the set of integer expressions, that Ris <=, and with a = 3andb = l + 2. It is clear 
-that aRb & bRa. Similarly suppose a = 3 and b = 2 + 4. Now it is clear that aRb & NOT bRa. This 


leads to the obvious consequence that: 
uRb& NOT bRa=>aOb 


Transitive 
Transitivity is an interesting property that is encapsulated casily in mathematical notation. 
` k 


aRb& bRc => aRc $ 


If Phil is older than L.ce and Lee is older than Rob, then Phil is older than Rob. If New York is larger 
than Chicago and Chicago is larger than Nashville, we know, even without comparing them directly, that 
New York is larger than Nashville. : 


Transitivity may seem so obvious that you might think it trivial. There are, however, numerous 
examples of things that we try to order where transitivity doesn't hold. One recurring example is in 
sports. We often find the case where team A can beat team B and team B trounces team C. But when 
team A and team C meet, team C wins! Another example is politics. Here we might find that Mondale is 
preferred to Reagan and Reagan is preferred to Glenn, but that Glenn is preferred to Mondale. The 


point of this is that binary relations over sets need not be transitive. But, if the set is to be ordered then 
transitivity must hold. £ 


Reflexive 


Another surprising property of a binary relation over a set is reflexivity. For a set S to be ordered with 
respect to R it must be the case that? 


aRa for every ain S 
Suppose that S is thc“integers and R is <=. ‘Then this property of reflexivity implies that 1 <= 1,2 <= 


2 etc... for all integers. You might be thinking, “Is there any binary relation over some well known set 
that can possibly fail to exhibit reflexivity?" Consider § the sct of male children ina family and R to be is 
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Thus, the idea behind merge sort is to break the original list up into lists of tenjth 1, then start merging 
the sorted lists back together until there is only one list left. ‘The following diagram illustrates how a list 
of 16 clements is broken up into 16 onc- -element lists, merged into 8 two-element lists, then 4 four- 


element lists, 2 cight-cleiment lists, and finally one sorted sixteen- element list 
DBOOOBOHOOOOOOO 
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The following illustration shows an cxample with values added: 
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Guu 13 59 78 83 91 93) G 8 21 31 67 81 88 37) 
Caru 13 21 31 59 67 78 81 83 a8 91 93 97) 


The algorithm for merge sorting is recursive: 
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PROCEDURE InsertionSort (unsorted, sorted) 


BEGIN 
sorted[1] + unsorted[1] 


FOR index + 2 to NumberOfFElements 


BEGIN 
inner + index 
WHILE (inner > 1) CAND (unsorted[ index] < sorted[ inner-1}) 


inner + inner - 1 


insert(unsorted[ index], sorted, inner) 
END © 
END 


Note that the condition for the WHILE loop uses the operator CAND. ‘The CAND operator is known 
as a conditional AND operator. It has the property that the Icft operand is always evaluated before the 
right operand and the right- operand is only evaluated if the Ieft operand evaluates to truc. If the Ieft 
operand evaluates to false, the value false is returned without evaluating the right operand. The 
conditional AND operator is commonly used to avoid division by zero or, out-of-bounds array indexing. 


Here we insure inner-/ is always a valid array index. 


Briefly the insertion sort algorithm works as follows: the first element of unsorted list is used to create 
the initial sorted list. Then each of the remaining elements of unsorted list are inserted into the 
ne sorted list one at a time always maintaining the list in sorted order. The 


appropriate position of tl 
at the end of the sorted list and 


- correct position for each clement being inserted is determined by starting 
searching towards the front of the list. > e 


For this code, insert takes a valuc to insert, a list in which to insert the value, and a position in the list to 
insert the value. Note that all values starting at the current position must be appropriately shifted within 
the list. 


The illustration below indicates how insertion sort works. 


unsorted sorted sorted sorted sorted sorted 





index = 2 index = 3 index = 4 index = § 


Note, however, that we can do insertion sort in one list rather than two, as the following modified code 


illustrates. 


- ta 
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10.6.1 Pseudo Code : 


PROCEDURE Quicksort(s) 


BEGIN - ee 
IF s contains 0 or 1 elements ig = 
THEN RETURH(S) 


ELSE BEGIN z 
choose an element e from s 


partition s into sequences S,, So, and s ; . 
so that elements of each are less than, equal 
to, and greater than e respectively : 


RETURN (Quicksort(s,). S24 Quicksort(s,)) 
END : 
END 


10.6.2 Analysis 


All the comparisons in this algorithm are done du ring the process of partitioning. Each clement of the 


sequence must be compared with the partition clement. Hence, if n elements are in the sequence, 


partitioning requires n-1 comparisons. 


Note that in using Quicksort, cach sorting problem is broken into two smaller problems. Tn the average ` 


case, approximately half of the elements go into cach subproblem. When this is done, a tree of sorting 
problems is formed. Each leaf node corresponds to an clement to be sorted. Recall that a tree of n leaves 
has a height of log(n). ‘Therefore, the average case and the best case need O(n*log(n)) comparisons. 


In the worst case, the element chosen to partition around is always the smallest or the largest. This gives 
the tree a height of n rather than log(n). In the worst case, Quicksort requires O(n?) comparisons. A good 
deal of study has gone into the performance of quicksort. [6] A good rule of thumb for quicksort is that 
by selecting three elements (instead of onc) and using their median in the algorithm, the likelihood of 


pathological behavior is very, very low. 


10.7 *** Heap Sort 


Suppose we have a group of marbles, cach with an integer value associated with it, that we want to sort 
in decreasing order. All six of the algorithms we've discussed so far would work just fine. However, 
assume we have a special bag to assist in our sorting task. ‘The marbles may be deposited into the bag, in 
any order. The bag is special in the following way: whenever we reach into the bag to remove a marble, 
we are guaranteed to grasp the marble with the largest value. This is indeed a very special bag, for it 
would simplify greatly the task of sorting the marbles. We would put all the marbles into the bag, then 
remove them one by one in decreasing order. The special bag idea excmplifics a data structure known as 
a heap. The use of the heap data structure for sorting is referred to as heap sort. 


A heap is a binary tree with the following three propertics: 


e The value of each node is greater than or equal to the values of both children. 
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10.3.1 Pseudo Code 


PROCEDURE BubbleSort (list) 


BEGIN 
FOR outer ‘— NumberOfElements-1 DOWNTO 1 
FOR inner + 1 TO outer 
IF list[Linner] < list[outer] ae 
THEN Exchange(list[Louter], list[Linner]) 
END 


10.3.2 Analysis 


‘Again let's analyze the number of comparisons necessary for this algorithm. We sce that cach iteration 
of the inner loop has another comparison, although it doesn’t necessarily have an exchange. How much 
data is already ordered has no effect on the number of comparisons! It does ‚however, affect the number 
of exchanges. ‘The number of comparisons is the samc in the worst case, average case, and best case, It 
depends only on the number of elements to be sorted. 


If the number of elements to be sorted is n, the number of comparisons needed is the following sum: 
L+24+3+4.. + (n-l) = n(n-1)/2 = O(n’). 


10.4 Selection Sort 


“Lhe idea behind selection sort is that we select the smallest clement from the list by making a pass over 
the data. We place this smallest clement in the first position, On a second pass, we select the second — 
smallest clement, placing it in the second position. ‘This process continues until all elements have been 
selected and placed appropriately. ames 


10.4.1 Pseudo Code 


If we start with the unsorted list in unsorted[1..NumberOfElements] and want the sorted list in 
sorted{L.. NumberOfllements], then the following pseudo code illustrates the algorithm: 


: ` 
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Recall that when a value is removed from the heap, it is always the largest value. But since the root has 
the largest value, the root is the node to be removed. We are not sure which node will be the new root, 
but we are sure that when the deletion process is completed, there will be one less node at level d than 
before. If there is a node with one child (there will be only one), we exchange its children value with any 


leaf at level d. ‘The value not at the root is percolated down the heap. so that ail heap propertics are ` 


insured. < 
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outer = 2 outer = 3 outer * A outer » § 





10.4.2 Analysis 


Let's analyze the number of comparisons of both versions of selection sort. For the original version of 
selection sort, the worst case, average case and best case are always exactly the same. ‘There are n-l 
comparisons for each iteration of the outer loop, which iterates n times’for a total of n(n-1) comparisons. 


‘This is a total of n-n= O(n’) iterations. 


For the modificd selection sort, the average case, worst case and best case still all use the same number 
of comparisons. However, each requires a different number of swaps. The number of comparisons 
necded isl +2+4+3+..+ (n-1) = n(n-1)/2.' Although this is 1/2 of the number required by the first 


version, it is still O(n). 


10.5 Merge Sort 


ore lists. For example, given two lists which are 


The idea behind merge sort is to merge two or m 
d list from the two. Consider the task of merging 


already sorted, it is relatively casy to form a new sorte 
the following two ordered lists into a combined ordered list. 


list 1 = (21, 43, 47, 51} 


list 2 = {16, 18, 22, 27, 33, 59} 


We need only check to the first element of cach ordered list to construct the single ordered list because 
we know that the first element is the smallest in each list. ‘Therefore, the smallest clement from the 
combined list must be either the first clement from list one or the first clement from list two. In this case, 
it is the first element of list two. Now that we have the smallest clement in our resultant list, what is the 
second smallest element? In this case, it must come from either the first clement of list one or the second 
clement of list two. Onee the smaller of these two is found, the process continues. 


resultant ordered list = {16, 18, 21, 22. -27, 33, 43, 47. 59} 


We see that it is’casy t merge two sorted lists into one sorted list. However, we normally start with one 
unsorted list rather than two sorted lists. The problem is to get the two sorted lists from one unsorted one. 
Actually, it is really casy. Instead of thinking of one unsorted list of n clements, think of it as n lists of 1 
clement each. Lists of length 1 are by definition already sorted. Merge adjacent sorted lists. ‘There are 
now n/2 ordered lists, each having two elements. Continue merging adjacent pairs of ordered fists until 


there is only one list left. It will be ordered! 
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10.7.1 Pseudo Code 


c understood, the procedure for adding a value to a heap 


Once the SiftUp and SifiDown procedures ar 
and deleting the heap’s root becomes easy to understand. ‘The following procedure illustrates the use ofa 


heap in sorting.- We assume input is in unsorted[1..NumberOfllements} and result is to go in 
sortcd{1..NumberOfElements}. 


PROCEDURE HeapSort 


BEGIN ; 
FOR index + 1 TO NumberOfElements 


AddTo Heap(unsorted[ index], heap) 
FOR index + 1 TO NumberOfElements 
sorted[ index] -+ DeleteRoot(heap) 
END 


10.7.2 Analysis 


10.7.2.1 addtoheap 


In the worst case, every clement must be sifted up to the root which requires O(log(n)) comparisons for 
a heap with n nodes. In the best case, no new nodes need to be sifted up the heap, "thereby requiring only 
one comparison. ‘To build a heap of n nodes, there are O(n*log(n)) comparisons in the worst case and 


O(n) in the best case. 
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PROCEDURE Sort(list) 


BEGIN 
IF length(list) = 1 
THEN RETURN(Tist) 
ELSE BEGIN =: 
middle « length div 2 
RETURN(Merge(Sort(list[1..middle], Sort[middlet1.. length] 
END 
END 


PROCEDURE Merge(firstlist, secondlist) 


BEGIN EE 
WHILE NotEmpty(firstlist) AND NotEmpty(secondlist) DO 


BEGIN 
compare the smallest element of each list 


remove the smaller of the two from the sorted list 
and add it to result list 
END 


add any remaining segment of the original list to the new list 
END 


10.5.1 Analysis 


In the worst case. to merge two lists with a total of n elements, we need n-1 comparisons. The above 
diagram shows the tree structure of the algorithm. Recall that a tree with n leaves has a height of log(n). 
Hence, there are log(n) le vels, each of which requires O(n) comparisons. Therefore, there is O(n*log(n)) 
comparisons required for worst case, average case, and best case. 


Merge sort also works for solutions to the external sorting problem, because only two clements have to 
be in primary memory at any stage for merge sort to work; all other values can reside on tape or disk. 
Once the smallest element is selected from the pair, it is written out to secondary memory and another 
value is read in. In the early stages of developinent of ciectronic computers, when internal memory ‘was 
very limited and expensive, all automatic sorting was done this way. In fact, merge sort was one of the 
first stored computer programs. 


10.6 *** Quick Sort 


Quicksort is another sorting algorithm that is best stated recursively. Unfortunately, it is somewhat 
easier to understand than it is to implement. ‘To sort a sequence S, we first pick an clement c from the 
sequence. We then partition the sequence S into the sequences S}. Sy. and S, such that all elements in $S} 
are less than c, all elements in S are greater than c, and all elements in S, equal e. Then we recursively 
sort S} and S,. $, is by definition already sorted. ; 
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10.9 *** Radix Sort 


Radix sort is also referred to as Bin sort and Bucket sort, Say we had to sort a group of numbers in the 
range | through 1000, If we had a collection of bins to distribute the numbers say 10 of them, 1-100, 
101-200, 201-300, ctc., we could first distibuic the numbers in their appropirate bins, sort each of the bins 
separately and finally concatenate each of the 10 sorted lists into one sorted list. ‘This is the essence of bin 
sort. Kirst we distribute the elements into the bins. We then sort cach bin separately.” Finally, we 
concatenate the sorted bins to form the sorted list of elements. ‘The following code illustrates the 


algorithm. 


10.9.1 Pseudo Code 


PROCEDURE BinSort 


BEGIN 
FOR index + 1 TO NumberOfElements 


AddToBin(Element[L index], Bin( fn(element[index]))) 


FOR index + 1 TO NumberOfBins 
SortBin(Bin[ index]) 


concatenate bins z : 
END 


The bin into which an clement is placed depends on the value of the element. It is a function of it’s _ 


value and is signified in the statement of the algorithm by the function call to fn. 


10.9.2 Analysis 


In the best case, no more than one clement is in each bin. Therefore, no sorting within bins is needed. 
We need only distribute the elements into the bins and concatenate the bins. If there are n elements and 
b bins, we can do them in O(n) and O(b) time respectively. In the average case, there are less than some 
constant number of elements in each bin. Sorting within bins can still be donc in constant time. Hence 
the algorithm takes O(n) time in the average case. In the worst case, they all end up in the same bin. The 
time then depends on how cach bin is sorted. Typically, it will be a O(n’) algorithm since the bins are 
expected to have few clements(small n) relative to the number of elements to be sorted. But in the last 
section, we said n*log(n) is a lower bound for comparison based sorting. How then can this algorithm 
only require O(n) comparisons to sort? In the strictly comparison based sorting algorithms, we knew 
nothing else about the elements. However, to have an expected time of O(n) with bin sort, we must know 
something else about the elements we wish to sort. ‘Technically, we say the clements to be sorted must 
satisfy a Lipsitz condition. This means that the data should be randomly distributed and the number of 
bins should be of the same order as the number of elements being sorted. 


10.10 Summary of Sorting Algorithms 


= 
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o All leaves in the tree are at a depth of d or d'l. 


e ‘There is at most onc node in the tree that has other than zere or two children. 


‘The primary property of a heap is that the valuc-of each node is greater than or equal to its child’s. This 
property insures that the node with the largest value is at the root of the tree (top of the heap). ‘The other 
two properties tend to keep the tree “balanced” and "cull". ‘They are not strictly necessary for the 


underlying heap structure.. 
= a 


The following diagram illustrates a heap: 





a new value, we are not sure where the node should be placed. We do 
know that there will be one more node at depth d and the same number at level d. ‘Therefore, we add the 


at level d and percolate it up the tree until all heap properties are satisfied. If there is a node 
d it will be at level d-1 ), we add the new node as it’s child, otherwise 


When we wish tq add a node for 


new node 
with one child (it will be only ene an 
we add it to any leaf node at level d-1. 


The following procedure SiftUp illustrates this operation: 


PROCEDURE SiftUp(node) 


BEGIN 
IF value(node) > value(parent(node)) 
THEN BEGIN 
swap(node, parent(node)) 
Sif tUp(parent(node) ) £ 
END 
END ‘ 


The following sequence of diagrams illustrates the insertion process on a heap. The value "78" is to be 


added. 
P 
925 l 
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the two or more duplicates precede the other duplicates since they are duplicates. Ilowever, 
consider sorting a list of data which is already sorted and contains two or more duplicates. j 
‘The sorting algorithm, of course, yields a sorted list. If the algorithin makes no exchanges | 
among the duplicates in the sorted fist, the algorithm is considered stuble. Which Qf the z 
following sorting algorithms are stable? i 
a. Insertion sort : moe 
b. Exchange sort i 
c. Selection sort 
d. Merge sort i 
c. Quicksort 2 
/ 
į i [| 
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= 


‘The following procedure SifiDown illustrates the process: 


PROCEDURE Sif tDown(node) 


BEGIN ; 
IF (value( lef tchild(node) ) > value(node)) OR 


(value(rightchild(node) ) > value(node)) 


THEN BEGIN 
swap(value(node), value(largernode) ) 
SiftDown(large rnode) 
END 


END 


` The following sequence of diagrams illustrates the root removal process on a heap: 
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5. The worst cases that could be presented to the straight insertion sort would be 


I. a list in the correct order 
Il. a list in reverse order 
Ill. a list in random order 


(A) I 

(B) II 

(C) HI 

(D IAI 
(E) I&I 


6. The order of magnitude of the average soi (number of compares) for 
the bubble sort on a list of length N is i 
{A) N log N 
(B) N squared 
(C) N/2 
íD) Log N 
(E) N 


Questions 7—9 are based on the following program segment: 


for I := 1 to N — 1 do 3 
begin - 

Value := All); 

Position := I; - 

for] := I+ a a 
if A[/] < Value then 


begin 
1 Value := Alj|; 
Position := J ee 
end; ` ae eo: 
A[Position] := A{I]; 
All] := Value 
end; 


7. This program sorts A into ascending order by means of: 
(A) bubble sort 
(B) selection sort 
(C) insertion sort 
(D) heapsort 
(E) quicksort 


8. The segment that exchanges the smallest item with the first item is: ; 
(A) Value := A|) à ; 
Position = J 
(B) A[Position] := A|I] 
All] := Value 
(C) for J := I + 1 to N do 
if Alj] < X 


t 
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10.7.2.2 deleteroot 


In the worst case, the value exchanged with the root must be sifted down to a leaf node upon every. 
n nodes. In the best case, it requires no 


deletion, thereby requiring O(log(n)) comparisons for a heap of 
siftdowns and only one comparison. ‘Therefore, removing all n values from the heap requires O(n “log(n)) 
comparisons in the worst case and O(n) in the best case. ‘Though we do not show it here, heap requires 


O(n*log(n)) in the average case also. 


10.8 A Lower Bound on Comparison Based Sorting 


We have looked at the problem of sorting n clements drawn from a totally ordered set. Since nothing 
celse was known about the elements, we used comparisons to gain information about the problem. Hence, 
all six of these sorting algorithms are comparison based sorting algorithms. In analyzing these algorithms, 
we found that some take O(n*log(n)) comparisons, some take O(n*log(n)) comparisons in the average 
case, some take O(n?) in all cases, cte. What is the fewest number of comparisons required? 


Consider the problem of sorting three elements named a, b, and c. ‘The following decision tree depicts 
the solution. The comparisons are indicated with circles which contain the two elements being compared. 


f The outcome of each compare is indicated along the arcs of the tree. 


asb 
b < c <b 


ef =e na e <b 
goo N CA 


Figure 10-1: Decision Tree For Sorting Three Elements 


Oee 


Each leaf node (indicated by a box) corresponds to a possible permutation of the elements. For n 
elements there are n! (n factorial) permutations. ‘Therefore, the decision tree has n! leaf nodes. Recall 
that a binary tree of height h has at most 2” Jeaves. Therefore, a binary tree with n! leaves has at least a 
height of log,(n!). Since each level of the tree has a comparison. all comparison based sorts on some 
input require O(log(u!)) comparisons. From Stirling's formula we know n! = (n/e)". ‘Pherefore, log(n!) 





- = n(log(n) - log(e)) = ‘n*log(n) - L44n. ‘Thus, we may state the following fact: All comparison based 
g sorting algorithms must sometimes require O(u*log(n)) comparisons. 

3 “Kile Date; January Ist 1985 at 17:22 Scribe Date: January 2nd 1985 at 00:36 
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Questions 14-16 refer to the following tree. 


14. 


15: 


17. 


18. 





The preorder traversal of this tree will output: 

(A) ABHICDFGE i 
(B HIBAFDGCE 

(C) IHBFGDECA 

(D) ABCDEFGHI 

E) IHGFEDCBA 


The inorder traversal of this tree will output: 
(A) ABHICDFGE 
(B) HIBAFDGCE 
(C})IHBEFGDECA 
D ABCDEFGHI 
E) IHGFEDCBA 


The postorder traversal of this tree will output 
(A) ABHICDFGE 
(B HIBAFDGCE 
(C) IHBFGDECA 
D) ABCDEFGHI 


-(E) IHGFEDCBA 


A strictly binary tree with n leaves contains 
(A) n nodes 

(B) 2"! nodes 

(C) n? nodes 

(D) 2" nodes 

[E] n logan nodes 


A binary tree can represent an expression containing operands and binary 
operators. The root of such a tree contains an operator applied to the results 
of evaluating the left and right subtree. Given the operator ^ (and) V (or), the 
. following tree represents the expression: 


i 

. 
3 [i 
f 
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466 
Algorithm - Average “Worst Best 
insertion n? n? n 

_ exchange(bubble) n? n? n? 
selection =~ n? ige n? 
merge n*log(n)  n*log(n) n*log(n) 
QuickSort n*log(n) n? n*log(n) 
Heapsort n*log(n) n*log(n) n 


Radix n n n 


Table 10-1: Comparison of Sorting Algorithms 


Given these different sorting algorithms, which-one is the best? ‘The answer depends on the particular 
sorting problem. None of these algorithms is always the best nor always the worst. ‘The performance of 
each depends on the circumstances of its usc. For small sorts (small valuc of n), n? algorithms do as well 
as or better than most n*log(n) algorithms. Performance also depends on the degree to which the data is 
already sorted. Some algorithins do not use partial sorting; others do. You should try to find out as much 
as possible about the data to be sorted before selecting a sorting algorithm. If you know absolutely 
nothing about the data you will be sorting, use what you consider to be the simplest sort, probably the 


bubble sort. 


Since sorting is one of the fundamental problems in computer science, it has received much attention. 
‘The definitive work on the subject is Knuth Volume HI (8]. However, it is somewhat difficult to read. 
Another useful source is Design and Analysis of Computer Algorithms by Aho, Hopcroft and Uhiman [1]. 


10.11 Exercises 


1. Name a set and a relation which satisfy reflexivity and antisymmetry, but not transitivity. 
2. Name a set and a relation which satisfy reflexivity and transitivity, but not antisymmetry. 
3. Name a set and a relation which satisfy antisymmetry and transitivity, but not reflexivity. 


4. Write Pascal programs which implement the algorithms for the following: 


a. Insertion sort 
b. Exchange sort 
c. Selection sort 
d. Merge sort 
c. Quicksort 


f. Bin sort 


5. Modify all the above programs so they sort in decreasing order rather than increasing order. 
Hint: Ifyou have them implemented well, the modifications are very simple. 


6. Oftentimes, data to be sorted contains duplicates. ‘Typically we are not interested in which of 
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` Directions: For each of the following choose tha best answer. =e P 


1. The technique of sorting lists i in main memory is euler” 
(A) merge sorting 
(B) external sorting 
(C) repetition sorting 
(D) internal sorting 
(E) sequential sorting 


2. The element on ae he sorting takes Piep i is called the. 
(A) pass ; 
(B) elernent 
(C) vector 
(D) key 
(E) order 


. 3. What name satveslioes the following sorting algorithm? 
—-start the index at one s 
—repeat remaining steps n — 1 times 

—locate the smallest of the remaining Ainea 
—swap with the indexed element k 
—increase the index by one 

(A) bubble 

(B) radix 

(C) selection 

(D) insertion 

(E) merge 


4. The worst case that could be presented to the quicksort would be 


I. alist in the correct order 
Ii. a list in reverse order 
Ill, alist in random order 


(A) I 

(B) I 

` (C) HI 

(D I&I 
(E) I&II 
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Multiple Choice Questions 


10. 


11. 


12. 


13. 


{D} Value := All] À 
Position := I 
(E) None of the above 


This program sorts lists of length: 


(A) N 
(B) N+1 
(C) N-1 
(D) 7 
(E) 7 


A sort which uses the binary tree concept such that any number is larger than 
all numbers in the subtree below it is called: 

(A) selection sort e eee 

(B) insertion sort 


. (C) heapsort 


(D) bubble sort 
(E) quicksort 


. Use the following algorithm to answer question 11. 


. Compare A[2} and A[1] 

. If A[2] < A[1] place A[2] before A[1] 

. Compare A[3] and A[2] i 

. If A[3] < A[2] compare A[3] with A[1] 

. If A[3] < A[1] place A[3] in position 1 and move adjacent items 


OR WN 


The algorithm above describes 
(A) selection sort 

(B) insertion sort 

(C) heapsort * 

(D) bubble sort 

(E) quicksort 


A sort which compares adjacent elements in a list and switches where neces- 
sary is a i 

(A) selection sort 

(B) insertion sort 

(C) heapsort 

(D) bubble sort 


| (E) quicksort 


A sort which iteratively passes through a list to exchange the first element 
with any element less than it and then repeats with a new first element is 
called 

(A) selection sort 

(B) insertion sort 

(C} heapsort. 

(D) bubble sort 

(E) quicksort 


aenn i: renee 


All] 
A(2| 
A[3] 
Al4] 
Als] 
A[6] 


Linear Linked Lists 


until the desired record is found or the end of the list (signified by Next = 0) is 
reached. Sorting a list involves changing index values. 

EXAMPLE: Assume the list in the preceding example has been sorted alpha- 
betically and the first index value is 4. The list 


All] 
A[2] 
A[3] 
Al4] 
A[5] 
A[6] 





might represent party-goers. Suppose, in ofder, JIM arrives, MARY leaves, VAL 
arrives, ANN arrives, and MARY returns. After each activity the list would appear 


as follows: 


FIRST INDEX = 4 FIRST INDEX =4 FIRSTINDEX =4_ FIRST INDEX = 5 








JIM arrives MARY leaves VAL arrives ANN arrives 


MARY returns = stack overflow 


The example illustrates one major constraint of the array representation of a 
linked list, namely the maximum array size must be declared in advance and 
cannot be adjusted during execution to reflect changes in the list. An array is a 
static structure; a linked list is a dynamic structure. The dynamic nature of a 
linked list is constrained by representing it as an array. 


2. Linked Lists in Pascal i 


Pascal implementations offer users a choice of array or dynamic representa- 


tion. The following example declares a list as an array of 50 records: 
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19. 


20. 





6 Multiple Choice Questions 


(A) (pAq) Ar 

(B) pva ^r 

(C) p V (qr) 

(D) rAq Vp 

(E) none of the above 


The maximum number of nodes at level three of a binary tree is 
(A) 8 : 

(B) 4 

(C) 9 

(D) 3 

(E) 6 


Sorts O(n log, n} are also O(n logion} 
(A) for binary sorts only 
(B) for quick sort only 

(C) under some conditions 

(D) always 

[E) under no conditions 


SÀ 


Linear Linked Lists 


will, respectively, allocate and free a node dynamically. No list of available nodes 
is required. Actual effects of a call to dispose(p) depend on each Pascal implemen- 
tation. References to the Info and Next fields of the Node(p) are given by p’.Info 
and p*.Next, respectively. nil is a reserved word denoting the nil pointer; it does 


not point to anything. 


3. Operations 


Standard operations with linked lists include: 


a) append/delete a note at the end of the list; 

b) insert/delete a node in a list (including at the front of a list); 
c) concatenate two lists; 

d) merge two ordered lists into one ordered list; 

e) free all nodes on a list; 

f) reverse a list; 

g) form a list which is the intersection/union of two lists; 
h} sort the list elements; 

i) search a list for a given element; 

j} return the number of nodes on a list; 

k} return the sum of integers in a list; 

l} move a node forward n positions in a list; 

m) copy a list; 

n) interchange the m" and the n™ nodes of a list. 


EXAMPLE: The following algorithm is designed to insert the integer 8 into 
the ordered list 2, 5, 7, 9, 11 and maintain the order in the list. The steps of the 
algorithm can be shown symbolically. 





© : 
Algorithm: Initialize p, q, r as pointers. Move q and p| = Next(q) ) down the list 
until p = nil or Info(p)>=8. If p = nil, then insert node at end of list, else insert 
node after node pointed to by q. 
The insert operation is executed in the order indicated in the “after” illustration. 
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The queue structure can also be represented as an array, but undesirable situa- 
tions can arise unless a wraparound feature is provided. That is, front and rear 
pointer values should be incremented modulo (mod) the array size. Deletion of a 
node is achieved by incrementing the front pointer; insertion by incrementing the 
rear pointer unless the queue is empty or full, respectively. ` 


7. Circular Lists 


A circular list is a linear linked list in which the Next field of the last node in 
the list is the address of the first node. Operations on circular lists are similar to 
those on linear linked lists. A circular list may have a listhead. It is possible to 
reach any node in a circular list from any other node. One use of circular lists 


LISTS 





CIRCULAR LISTS 


is to store elements of a sparse matrix in a more space-economical way than as an 
array. 5 i 


8. Doubly Linked Lists 


A linked list in which each node contains a pointer to its predecessor and a 
pointer to its successor is called a “doubly linked list”. List operations are similar 
to those in linear linked lists except that both Right and Left pointers must be 
changed. Also, only one pointer need be used to search for a list element. It is 
easier in a doubly linked list to reach any node from a given node than in a circular 
list, but extra space is required in each node for an additional pointer. 





LINEAR DOUBLY LINKED LIST 


CIRCULAR DOUBLY LINKED LIST 





i 
2 
i 





CHAPTER 4 ` 


Linear Linked Lists 


a 


1. Linear Linked Lists (Arrays) 


J In an array A, All] represents the value of the i® element and the next element 
is obtained by adding 1 to J. An array element may be a record consisting of a 
number of fields. In particular, one of those fields (call it Next) could contain a 
value of I and be interpreted as the index of (or pointer to) the next array element 
1 to be accessed. In this way, records could be stored within an array and linked so 
; -that they would not necessarily be contiguous. 
es EXAMPLE: Let each element (record) of an array A consist of two fields: 
| “Name, which contains a string up to four characters, and Next, which contains a 
legal value of the array index. Assume the first index value is 1 and that the next 
f index value is contained in the field Next, The value 0 signifies the end of the 
listing. 


Name Next 


A[1] 
A[2] 
A[3] 
A[4] 
A[5] 
A[6] 





The contents of the name field would be accessed in the order TOM, MARY, 
DICK, JOAN. TOM’s Next field points to the fifth element, MARY, and MARY’s 
Next points to the fourth element, DICK, etc. 

_ This is one way to represent a structure called a linear linked list, hereafter 
referred to as a “list” when the context is clear. The list structure provides a degree 
of flexibility not available in arrays. In particular, insertions and deletions of records 
can be accomplished by changing index (pointer) values in Next fields. No move- 
ment of records is required. Searching a list involves reading successive records 
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13-7- ype 
Node = record 
Car : string[ 10]; 
CodeNum : integer; 
end; 


Which of the following pointer type declarations is valid if placed after the type definition 
above? 3 
a. Pointer = “Car; c. Node = “Pointer; 

b. Pointer = “record; d. Pointer = “Node; 


13-8. What is the purpose of the caret (^) in the following? 


type | 
Pointer = “ReferenceType; 
a. The caret conveys to the compiler that Pointer is a pointer type and that variables of 
this type will contain data values of a ReferenceType variable. ; 
b. The caret conveys to the compiler that Pointer is a pointer type and that variables of 
-this type will contain an address of a ReferenceType variable. 
c. The caret indicates that ReferenceType is a dynamic data structure. 
d. The caret indicates that Pointer is a static data structure. 


13-9. What is the purpose of the caret (^) in A ^ Name assuming the following declarations? 


type > 
Name: string[10]; 
Node = “Name; 


var 
A, B : Node; ; 
a. Itrefers to the contents of Name which is pointed to by pointer A. 
b. Itrefers to the address of Name which is pointed to by pointer A. 
c. It conveys to the compiler that A is a pointer type and that variables of this type will 
contain data values of a Name variable. 
d. It indicates that A is a dynamic data structure. 


13-10. What is the function of procedure new? 
a. Itcreates new pointer types. 
b. Itcreates new variables of pointer type. 
c. It dynamically allocates memory space by giving a pointer type variable a location in 
memory to reference. 
d. It frees allocated memory space. 


Fe 


Use the following program declarations to answer questions 11-13: 


type eco 
Name : string[10]; 
Node = “Name; 


var 
A, B, C : Node; 


13-11. How would you assign a value to Name that is referenced by pointer A? 
a. Name. ^A := ‘Tom’; c. A.AName := ‘Harry’; 
b. A. Name := “'Dick’; d. A4.Name := ‘Joe’; 


oo enn enna an at mae Ce ie Ge s-on ee 
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const NumRec = 50; 
type RecPtr = 0. .NumRec; 
RecType = record 
Info : integer; 
Next : RecPtr 
: end; 
var a: array |1. .NumRec] of RecType; 
p : RecPtr; ; 


The Next field of the record with index p would be referenced as A[p].Next. 
The end of the list would be signified by A[p].Next:=0. When using this represen- 
tation, a list of available records should also be declared. 

In the representation of a list as a dynamic structure, each list element is 
called a “node” and consists of two parts, an Info field (which could be a record 
consisting of a number of fields) and a Next field. 


- NODE 


The Next field contains a pointer to the next list element. The pointer value is 
the address of the storage location of the next node. A general representation of a 
list is i: i 


. Info Next Info Next Info Next Info Next 


Z = 


where List is a pointer to the first list node and nil signifies the end of the list. If p 





LIST 


is a pointer to a node, the Info(p).and Next(p) are the Info and Next fields, respec- - 


tively, of the node pointed to by p. 
Pascal implementation of dynamic variables for linked lists involves declaring 
the type of a pointer and node as follows: : 


type NodePointer=*NodeType; 
NodeType = record 
Info : integer; 
Next : NodePointer 
end; 


where the Next field now is a pointer and contains the address of the next node in 
the list. The Info field could be declared as a record of more than one field rather 
than as an integer in the above representation. There is no need to declare an array 
because nodes are allocated and freed dynamically when this representation is 
used. If p is declared a variable of type NodePointer, then the statements 


new(p) and dispose(p) - 
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13-20. What change, if necessary, should you make to the following program declaration to 
facilitate the creation of a linked list? 


oa = ^Node; 
Node = record 
Name : string[10]; 
Age : integer; 
end; (* Node *) 


Link : Pointer; should be added as a field to the record Node. 
Link : Pointer; should be added after end. 

Pointer = “Node; should be placed after end. 

No change is needed. 


Refer to the following case and type definition to answer questions 21-23. 

Assume you have a linked list of three nodes, where the fruit field of each node has the 
values ‘Apple’, '‘Grape', and 'Pear', which are referenced by A, B, and C respectively. 
The type definitions are as follows: 


pe 
Pointer = ^Node; 
Node = record is Se 
Fruit : string[10]; ; er 
CodeNum : integer; 
Link : pointer; 
end; (* Node *) 


Ss a e ae ee e 


13-21. Which of the following statements would delete the node 'Grape'? 


a. A“.Link := nil; c. A^.Link := B “.Link; 

b. BALink := nil; d. B4.Link := C4.Link; 
13-22. After deleting the node ‘Grape’, what statement would free the storage that it is 

occupying? 

a. delete (B); c. dispose (‘Grape’); 

b. delete (‘Grape’); : d. dispose (B); i 

| 

13-23. Which of the following statements would insert a new node after node 'Pear'? ; 

a. new (C“.Link); c. new (“.Link); 

b. new (B4.Link; . d. new (Pear’.Link); 


13-24. When a variable is created by the mew procedure, it is stored on a structure called a 


a. queue =C. stack 
b. tree 3 : d. heap 
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4. Listheads 


A separately defined node called a “listhead” can be added to the front of a 
linked list. The external pointer List would then point to the listhead and the Next 
field of the listhead would be a pointer to the first node of the list, or nil if the list 
is empty. The Info field of the listhead could contain information about the list 
and be a different kind of record than the Info field of nodes on the list. For 
example, a listhead’s Info field could contain: 


a) the number of nodes on the list; 
b) the type of elements in the list; 


c) the stock or identification number of a major part for which the listele- 
ments are components; 


d) a pointer to the last node in the list; or 
e) a pointer to a record containing information about the list. 


When a listhead is used, its structure must be defined in a Pascal declaration and 
linked list algorithms must be changed to account for its presence. 


5. Stacks 


A linked list in which insertions and deletions are allowed only at the front of 
the list is called a “stack.” One example of a stack is the list of available nodes 
used in implementing insertions and deletions in linked lists. Stacks are primarily 
used as auxiliary structures in implementations of compilers, recursion, sorting 
and searching algorithms, and dynamic structures. 

Because stack operations are so heavily constrained, the stack structure can 
have a sequential representation as an array. A stack pointer, then, takes values 
which are array indices and always points to the top element of the stack. Deletion 
of a node (a “pop” operation) is achieved by decrementing the stack pointer unless 
the stack is empty; insertion (a “push” operation) by incrementing the stack 
pointer provided there is space in the array and assigning the value of the element 
to the array location pointed to by the stack pointer. 


6. Queues 


A linked list in which insertions are made only at the rear and deletions are 
made only at the front is called a “queue.” Implementations of a queue require 
two pointers, one to the front node and one to the rear node of the list. Queues are 
commonly used in simulations and in operating systems. ; 
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13-33. Which is a frequently used data structure for storing and retrieving information? 
a. stack c. tree 
b. queue ; . d. linked list 


13-34. What is the difference between an ordered tree and a nonordered tree? : 
a. An ordered tree can have no more than two subtrees, whereas, a nonordered tree can _ 
have an unlimited number. 
b. Inan ordered tree, the positions of the subtrees are important, whereas, the relative 
positions of the subtrees in a nonordered tree are not significant. 
c. An ordered tree is a linear data structure while a nonordered tree is not. 
d. A nonordered tree is a multiply linked list while an ordered tree is not. 


Use the following diagram of a tree to answer questions 35-37. 
Sias ee 
B Cc 
GEN AERAR 
D E F G 
Pa p I 


13-35. What would be the result of an inorder traversal of the tree above? 


a. HDIBEAFCG c. HIDEBFGCA 

b. ABDHIECFG d ABCDEFGHI 
13-36. What would be the result of a preorder traversal of the tree above? 

a. ABCDEFGHI - c ABDHIECFG. 

b HDIBEAFCG : d. HIDEBFGCA 
13-37. What would be the result of a postorder traversal of the tree above? 

a. ABCDEFGHI c. ABDHIECFG 

b. HDIBEAFCG d. HIDEBFGCA 


13-38. All of the following are major characteristics of a binary tree, except 
A node can only have two children. 

Each node can have no more than two siblings. 

Binary trees can be empty, that is, have no nodes. 

Binary trees are always ordered. 


aoo 


À” 12 
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Chapter 13 


Dynamic Data Structures 


MULTIPLE CHOICE 


‘13-1. 


13-2. 


13-3. - 


13-4. 


13-5. 


13-6. 


noo 


A static data structure is a structure which À 

a. is declared at the beginning of a program or a block and remains in existence 
throughout the execution of the block in which it is declared 

b. changes in size as a program executes and allows storage locations to be set aside as 
required and then freed up when no longer needed 

c. is only declared at the beginning of a program 

d. is only declared in the main program 


A dynamic data structure is a structure which , 

a. is declared at the beginning of a program or a block and remains in existence 
throughout the execution of the block in which it is declared 

b. changes in size as a program executes and allows storage locations to be set aside as 
required and then freed up when no longer needed 


‘ec: is only declared at the beginning of a program 


d. continually changes its data type throughout the program 


“Which of the following is an example of a static data structure? 


a.. stack- . ; c. tree 

b. record d. linked list 

Which of the following is an example of a dynamic data structure? 

a. record c. queue 

b. array d. set 

A dynamic data structure is a collection of elements called that can grow or 
shrink, depending on how many elements the structure contains at any given time. 
a. records c. strings 

b. nodes d. lists 


What is characteristic about dynamic data structures in memory? 

The elements are not stored consecutively in main memory. 

The elements are only stored in secondary storage. 

Storage space is set aside throughout the execution of the whole program. 
Elements in memory cannot be accessed while the program is being executed. 
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13-41. Assume you need to declare a node that could be used to implement a binary tree I 
containing student information consisting of name and Social Security number. Which 
of the following declarations is correct? | 
a. type c. type 
Branch = “Tree; Branch = “Tree; 
Tree = record ' Tree = record 
Name : string[20]; Name : string[20]; | 
SSN : string[11]; SSN `: string[11]; 
Left, Right : Branch; Left, Right : “Trees; 
end; end; i 
var var 
Root : Branch; Root : Branch; 
b. type d. type . 1 
Branch = “Tree; Tree = record 
Tree =record ` Name : string[20]; 
Name : string[20]; SSN : string[11]; fj 
SSN : string{11]; Left, Right : Branch; 
end; end; 
var Branch = “Tree; i 
Root : Branch; var 
Root : Branch; 
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13-14. 


13-15. 


13-16. 


13-17. 


13-18. 


13-19. 
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Assume that the value ‘Jane’ has been assigned to Name which is referenced by A. 
What is the effect of the statement B := A;? 

The value ‘Jane’ is copied into B. 

The memory address stored in A is copied into B. 

The memory address stored on B is copied into A. 

ek A is now pointing to a different node, i.e., to a node which contains no 
value. 


ao 


Assuming that the value 'Peter' is assigned to Name which is referenced by B, what is 
the effect of the statements 

new (C); 

CA 15:3"; 

The memory address stored in B is copied into C. 

The memory address stored in C is copied into B. 

A newly-created node pointed to by C will contain the value ‘Peter’, 

An error message will be generated. 


oO Fe 


Which of the following statements about pointers is true? 

a. The value of a pointer variable is the address of a storage location. 

b. Pointers cannot be involved in expressions using relational operators. 

c. Expressions using arithmetic operators can be used with pointers. 

d. The value of a pointer variable is the value contained in the node it is pointing to. 


A linked list is a i 

a. simple form of static data structure 

b. group of nodes that are connected together 

c. group of variables of the same type that are connected together 
d. compiled list of variable names 


Linked lists are also called data structures. : À 
.a. straight - c. horizontal 
b. linear ; d. vertical 


What is the major advantage of linked lists over arrays? 
a. Itis always kept in secondary storage. 

b. Itis faster. 

c. Itis easier to modify. : 

d. The code is easier to write and understand. 


The end of a linked list is indicated by having ; 
a. a boolean value of ‘false’ c. the pointer field point to nil 
b. the pointer field point to zero d. the-pointer field undefined 


What is a major disadvantage of the linked list? 

a. To access a node of a list, you have to traverse the list till the desired node is 
- reached.” 

b. It is difficult to add or delete nodes from a linked list. 

c. Ituses large amounts of memory space. 

d. There is a limit to the number of nodes a linked list can contain. 


yt 


A stack is an area of memory created when the program execution 
begins. 

True 

False 


Once the terminating case of a recursive procedure is met, 
the solution is found 

the preceding call can be completed 

the procedure exits to the call instruction 

the part of the stack used for the procedure is empty 


The QuickSort should be used to sort all lists. 
True ` 
False 


The focus of the QuickSort is on ‘ 
recursion 

partitioning 

the first component value 

the left and right lists 


The process of partitioning stops vheti ° 
the sublist has the length of one 

the sublist has the length of zero 

the left sublist has the length of one 

the right sublist has the length of zero 


A Pascal reserved word that is used to notify the compiler 
that the procedure (or function) will be defined later in 
the program is Ste 

LATER : 
FORWARD 

MARK 

RELEASE 





A BOOLEAN expression that is used to halt a recursive 
procedure call is . 

HALT ; 

EXIT 

the terminating case 

the beginning case 





An array is a data structure. 
static 

dynamic 

simple 

heap 
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What is the difference between procedures dispose and release? 

a. It is impossible to dispose of a node in the middle of a heap using dispose. In this 
case, release should be used. : 

b. When dispose is used, the node pointed to by the pointer variable in the dispose 
statement is "freed up"; when release is used, all the nodes above the pointer 
variable are released. 

c. When dispose is used, the node pointed to by the pointer variable in the dispose 
statement is "freed up"; when release is used, all the nodes below the pointer 
variable are released. 

d. Dispose works faster than release. 


Which of the following statements about stacks is false? 
a. Astack isa list in which all insertions and deletions take place at one end, called the 


top. 

b. Stacks are said to be "Last In, First Out" lists. 2 

c. A stack is a list in which all insertions take place at one end, called the rear, and all 
deletions take place at the other end, called the front. 

d. The elements of a stack can be of any kind of data, for example, records, arrays, 
linked lists. 


When an element is removed from a stack, that element is said to be off. 


a. pushed c. popped 
. b. pulled d. picked 
When an element is added to a stack, it is said to be on. 
a. popped í c. pushed 
b. added d. placed 


What is a queue? 

a. Itis a listin which all insertions and deletions take place at one end, called the top. 

b. It isa list in which all insertions take place at one end, called the rear, and all 
deletions take place at the other end, called the front. 

c: ` Itis a LIFO list. ; 

d. Itis a doubly-linked list. ; 


A tree can be defined as a ; ; 
a. nonlinear data structure with a root that points to zero or more subtrees 


_b. linear data structure composed of individual nodes that have more than one link 


c. linear structure composed of branches, leaves, and roots 
d. complex multiply linked list 


Which of the following data structures is the most suitable for the task of reversing the 
order of a list? 

a. queue c. linked list 

b. tree d. stack 


The nodes at the bottom of the tree, which have no downward branches and whose 
pointers are nil, are called 

a. siblings c. roots 

b. descendants d. leaf nodes 





y-\l 


22, Once NEW is executed, the value of MemLoc is ‘ 
a; 4 

b. NIL 

c. the address of a memory location 

d. an INTEGER 


23. MemLoc* has data type a 
a, pointer 

Da INTEGER 

c. dynamic 

d. static 


24. After the instruction READ (MemLoc*) is executed and the value 
entered from the keyboard is a positive integer, ° 
a. a run time error message Occurs, since you can't do T/O on 


pointers 
b, a compiler error message occurs 
c. the integer value is assigned to MemLoc 
. none of the above 


Use the following declarations to answer questions 25 - 26. 


TYPE Rec = RECORD 
name : STRING[40]; 
age : INTEGER 
END; 
RecMemLoc = “Rec; 


VAR Person : RecMemLoc 


25. To assign a value to the name field of the memory location 


referenced by Person, could be used. 
n S 
a. Person*.Name := ‘Juan 
b. Person.Name := 'Waerwic' 


'Morganna' 
"Matt' 


c.e Person*.Name* 
d. Person.Name* : 


26. To assign a value to the variable Person, could be 
a. Person* := 1000 

b. Person := 1000 

c. NEW (Person) 

d. Person.Name* := NIL 


27. Values stored in the memory addresses referenced by variables 
of type pointer are stored in the area of memory. 

a. stack 

b. heap 

c. - ROM 

d. dynamic 
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13-39. What is the output produced by the following code (assuming it is embedded correctly in 
a program)? All the pointers are of type ^integer. 


new (P1); 

new (P2); f 

P14 := 10; * 
P24 := 20; 

writeln(P14, na 

PES; 

writeln(P1^, Ey: 

Pins 


write EA P24); 


P24 := 40; 

writeln(P1^, Fo); 

a. 1020 c. 1020: 
2020 — 20 20 
30 20 30 30 
30 40 40 40 

b. 1020 d. 1020 

=- 2020 10 10 
30 30 30 10 
40 30 30 40 


13-40. What is the output produced by the following code (assuming it is embedded correctly in 
a program)? All pointers are of type “integer. 


PIA sx 10: 
P24 := 20; 
wniteln(P i”, P2} 
PR 7: 
writeln(P14, P24% 
P14 := 30; 
writeln(P14, r2*); 
P24 := 40; 
writeln(P14, P24); 
a. 1020 c. 1020 
20 20 20 20 
30 20 30 30 
30 40 40 40 
b. 1020 : d. 1020 
20 20 10 10 
30 30 30 10 
40 30 30 40 


. 
o 
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Chapter 10 TEST 


Multiple choice questions. Select the best answer to 
each question. : 


l. 


ae 
b. 
Ce 
d. 


2a 
a. 
b. 
Ce 
d. 


3. 
Be 
b. 
Ce 
d. 


The requirement of a recursive definition is to define a(n) 
case. 

beginning 

ending 

special 

independent 


A process is recursive if it repeats itself until $ 
some preset condition is met 

the user halts execution 

the programmer uses EXIT 

none of the above 


The power of recursion is in its ma 

time efficiency 

memory efficiency 

stack efficiency 

ability to, sometimes, give a more readable and straitforward 
solution to a problem 


Any time a call is made to a procedure or function, the value 
parameters and variables are placed on a _ . 

table 
list 
stack 
heap 





A recursive call will terminate when f 
the loop control variable has reached its limit 
the ending case is met 

the solution is found 

a flag occurs 


The purpose of using a recursive solution is to > 
improve time efficiency 

improve design time 

reduce a problem to a simple solution 

confuse the programmer 


Recursion is used ‘in the sort. 
Shell 
insertion 


selection 
Quick 
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76 Multiple Choice Questions 
var : 
Next, Back, This : Ptr; 
First, Second, Third : Pointer; 
Student : node; 
Answer : Item; 
Flag, Switch : boolean; 
Index : integer; 
procedure GetString (var Thing : String; Len : integer); 
begin 
(* assume a procedure that gets a string of some length*) 
end; (* getstring *) : 


4. Which of the following is a legal type declaration statement? 
(A) Pointer : ‘real; 
(B) Ptr* := NumberPointer;. 
(C) Pointer := nil; 
(D) IntPtr = “integer; 
(E) Petr*.Link := nil; 


5. Which of the following is a legal assignment statement? 
(A) Ptr* := Student; 
(B) Next*.Link := Student; 
(C) Next* := Student; 
(D) Next := Ptr; 
{E) Back*.Link := Index; 


Use the following program segment and the previous definitions and declarations 
to answer questions 6, 7, 8, and 9. 


new (This); 

GetString (This*.Name, 10); 
new (Next); — 

GetString (Next’.Name, 10); 
This*.Link := Next; 
Next*.Link := nil; 


6. What is the value of the pointer in the second node in the list? 
(A) nil 
(B) undefined 
{C} an address 
(D) Next 
(E) This 


7. Which of the following will result in the entire list becoming a circular 
structure? 
(A) This*.Link := This; 
(B) This := nil; 
(C) Back := This*.Link; 
(D) This := Next*.Link; 
(E) Next*.Link := This; 


a D i 





E 16. A Linked Fi t iona data structure. 
a. Static 
b. dynamic 
Ce simple 
d. heap 


17. A variable whose type is is used to store a memory 
‘location. ; 

a. Stage. 

b.e heap 

ce. pointer 

d. INTEGER 


Use the following declaration to answer questions 18 - 19. 
VAR MemLoc : “INTEGER 


18. MemLoc has data type s 
a. INTEGER 

| b. pointer 
C. dynamic 

7 de stack 


19. To initialize the variable MemLoc, the procedure ~ can be 
used. : 

a. NIL 

b. NEW 

ce. MARK 

d. RELEASE 


20. The value can be assigned to any pointer. 
Qe NIL j 
b. NEW 
Z MARK 
RELEASE 


Use the following to answer questions 74. 2h 


VAR MemLoc : “INTEGER; 
BEGIN 
NEW (MemLoc); 
7 READ (MemLoc”“); 
i IF MemLoc^ >= 0 
THEN WRITE (MemLoc”) 
ELSE MemLock := NIL 


i 21. Pii to executing the instruction NEW, the value of MemLoc 
is ° 
E 0 
b. undefined 
Pee 
| d. infinite 
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78 Multiple Choice Questions 


First*.Info := 5; 
writeln (Second”.Info); 
(A) 15 
(B) 50 
(C) 0 
(D) 30 
(E) 10 


13. What is printed by the following program segment? f 
new (First); 
new (Second); i} 
First’.Info:= 10; . 
Second’ Info := 15;. 
First := Second; 
First’.Info := 5; 
writeln (First’.Info, Second’.Info); 
(A) 10 5 : i} 
(B) 10 15 
(Cc) 5 5 
(DE 5238 ; ns , i 2 il 
{E) $245 
14. What is printed by the following program segment? ` ee if 
new (First); j 
First’. Info := 25; 
new (Second); fl 
Second’. Info := 50; 
new (First); i j 
First’.Info := Second*.Info p 
Second’. Info := 30 
writeln (First*.Info, Second’.Info); 
(A) 50 25 
(B) 25 50 i 
(C}) 50 50 
(D) 50 30 
(E) 25 30 | 


Use the following information to answer questions 15-18. 
Given the following procédure headings: 


procedure Insert (x : integer), 


(* inserts x at appropriate place in list ‘I 
procedure Delete (var x : integer); 


(* deletes appropriate value returns as x *4 


And the following program fragment: 





28. 
ae 
b. 
Ce 
de 


29. 


ae 
b. 


30. 


Be 
b. 
Ce 
d. 


31. 


ae 
b. 


32; 
ae 


b. 
Ce 
d. 


ee 


b. 
34. 


ae 
b. 


35. 


ae 
b. 


36. 
ae 
b. 


d. 


A- linked list is a data structure. 
simple 

static 

dynamic 

heaped 


One or more of the fields of a record in a linked list are of 
type pointer. 

True 

False 


A linked list builds a list of records with each record linked 
to the next by a(n) x 

index 

record number 

field of the record 

array of the record 








Without knowledge of where the first record is located in 
memory (in a linked list), the entire list is lost. 

True 

False 


A header record is e 

a record in a linked list that contains the address of the 
"head" of the linked list 

useless 

hard to create 

necessary for program execution 


A major advantage of using a header record occurs when 
inserting a new first record. 

Teus ae 

False 


One of the major advantages of creating a linked list is to 
conserve memory. 


True 
False 


The procedure MARK marks the stack area of memory used for 
a linked list. 

True 

False 


A linked list is a data structure. 
nonlinear 

tree 

linear 

parallel 


{ 
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Directions: For each of the following, select the best answer. 


1. An inventory control system uses a last-in-first-out system. The best data 
structure to represent that system is 
(A) queue 
(B) stack | 
(C) array 
(D) circular linked list 
(E) subscripted variables 


2. A telephone system saving calls to a particular number can best be repre- 
sented by 
{A) queues 
(B) stacks 
(C) circular linked lists 
- (D) subscripted variables 
(E) none of the above 


3. A waiting line for “Travel’s Bus” company can best be represented by 
(A) queue 
(B) stacks 
(C) circular linked list 
(D) array 
(E) subscripted variables 


Use the following declarations to answer questions 4-14. 


type ; 
Ptr = “Node; 
String = packed array|1 . . 10] of char; 
Pointer = “Item; 
Node = record 
Name : String; 
Link : Ptr; 
end; (* node *) 
- Item = record 
Info : integer; 
Connection : Pointer; 
end; (* item `) 


aral 
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Set #4: Linear Linked Lists 


Which of the following will result in an inability to reach the first node? 
(A) Next*.Link := This*.Link; 

(B) Next := nil; 

(C) This := This*.Link; 

(D) This*.Link := nil; 

(E) Next := This; 


What is the result of the following two lines? 


new (Back); 
Back*.Name := Next*.Link.Name; 


I. They create a new node referenced by Back. 
Il. They assign a value to the name field of new node. 
Ill. They create an undefined variable. 


(A) I&I 
(B) I & Ill 

(C) 1 & 

(D) I 

(E) Il 


Use the situation created by the program segment below in addition to the previ- 
ous definitions and declarations to answer questions 10 & 11. 


10. 


ii; 


12. 


new (Back); 
GetString (Back*.Name, 10); 


: Which of the following will connect the new record structure at the end of 


the list? 
(A) This*.Link := Back; 
(B) Back*.Link := nil; 
(C) Next := Back; 

(D) Next’.Link : 
(E) Back*.Link : 


Back; 
Next; 


Which of the following will connect the new record structure at the begin- 
ning of the list? 

(A) Back*.Link := This; i 

(B) Back := This; 

(C) This*.Link := Next; 

(D) Next := This; 

(E) Next*.Link := This; 


What is printed by the following program segment? 
new (First); 
First*.Info := 10; 
new (Second); 
Second’.Info := First*.Info * 3; 
new (First); 


77 





e N 


4 : 2 Computer Science: The First Course 


1.1.1 Charles Babbage 


The story of the modern computer: begins with a terribly eccentric genius from England, Charles 
Babbage, who lived from 1791 to 1871. Babbage appears to have foreseen the ideas that lic at the heart of 
the shopping mall computer that we know today. In his day, it was possible to construct machines to 
perform predetermined calculations. His new idea was to build machines -to perform a specified 
calculation based on the sequential execution ofa set of instructions. 


The importance of this insight cannot be overstated. Whereas the machines that Leibniz and Pascal 
had invented performed some basic operations of arithmetic, Babbage realized that a machine capable of 


performing an arbitrary sequence of suclr operations would be a much more powerful tool. “The Pascal ° 


machine could perform arithmetic; the Babbage machine used arithmetic in computing trigonometry, 
geometry, and even simple mechanics! 


Babbage actually planned and worked on several different classcs of computing machines. ‘The first 
was his Difference Ingine, which worked on the principle that the value of an expression of the form N 
+ N + C could be computed with no multiplications, by simply adding constants [7]. In the expression 
N2 + N + C, the letter N is a symbol for a whole number, and the letter C is the syinboi for some 
arbitrary constant. The following table illustrates how this technique works if we use 41 for the constant. 
The expression becomes N? +N +41. ; 


N24N+4]1 


ie) 
AS) 


Puno jz 
Sas 

OoOnALN, 

NNN 


The first column indicates values for N. The second shows the derived valucs for the expression 
N2-+N44L. The entries in the third column, D,. are the differences between the values for consecutive 
entrics in the second column, N24+N-441. For example, 43-41=2, and 47-43=4. The entries in the 
fourth column, labeled D are the differences between consecutive entrics in the third column. For 
example, 4-2=2, and 6-4=2. Here all entries of D, have the value of 2. 


Notice in the table the values for N24+N-+4], D, and D, for the case where N=2. These values (47, 4, 
2) added equal 53. Notice that the value for N2+N+41 is also 53 for the case where N=3. Thus it 
appears that the next value for the expression N2+N +41 can be derived by adding the current values of 
each case for N. Take, for example, the case where N=5. We know it will be the previous entry (61) plus 
the adjacent entry in column D.: We also know that the adjacent entry in D, will be the previous entry in 
1), (8) plus the adjacent entry of D,. Recall that all enuies of D, are 2. ‘Therefore, the adjacent entry of 
D, will be 8 + 2or 10. Likewise, the desired entry of the N2+N-+41 column will be 61 + 10 or 71. One 
can easily verify that the computed value is correct. 


Note that the value of N24+N+41 for N=5 was computed without multiplications by adding 


` 
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15. 


16. 


17. 
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Insert (A); 
Insert (B); 
Insert (C); 
Delete (D); 
‘Delete (E): 


If the data structure is a stack, what is the value of D? 
(A) 10 

(B) 15 

(C) 30 

(D) 25 

(E) 40 


If the data structure is a stack, what is the value of E? 
(A) 10 
(B) 15 
(C) 30 
(D) 25 
(E) 40 


If the data structure is a queue, what is the value of D? 
(A) 10 , 

it} Be 

(C) 30 


AD 25 . 


(E) 40 


If the data structure is a queue, what is the value of E? 
(A) 10 ee 
(B) 15 
(C) 30 
(D) 25 
{E) 40 


To answer questions 19 and 20, choose from the following advantages of various 
data structures. 


19. 


I. Conserves storage space. 
Il. Provides faster access to a specific element. 
Ill. Allows for faster insertion and deletion at the center of the list. 


Given a set of records reflecting the inpatient census of a large metropolitan 
hospital, a programmer would choose a linked list via pointers over an array 
for the following reason(s): 

(A) I&II 

(B) IAH 

(C) II only 

(D) 11 & Il 

E | Tail 
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1.1.2 American Machines 


Some have suggested that Babbage did not receive support from the Royal Society of Science in 
building his Analytical Engine because the ‘need for computing was not widely recognized. However, 
toward the end of the nineteenth century, a number of scientists recognized the need for rapid, accurate 
computations and attempted to satisfy that need. Much of this activity took place in America. 


Albert A. Michelson was a leading physicists of the nineteenth century. His work had a decidedly 
empirical! bent to it, and his calculations were so exact that he was able to correctly determine the velocity 
of light. In 1907, he was awarded the Nobel Prize for his methods for exact measurement. He was 
America’s first Nobel Laureate in physics. Although Michelson accomplished many things -only 
peripherally related to computing [9], his major contribution to the field was the design and construction 
of a machine capable of computing the sum of a Fourier series? to a great degree of accuracy. ` 


Whereas the developments discussed thus far arose from science’s need for computing, the next arose 
from a much more practical need. John Billings, who was in charge of the work on vital statistics for both 
the tenth and cleventh U.S. census (1880 and 1890 respectively), was directly responsible for collecting 
and tabulating the census data. As the eleventh census approached, the results of the tenth were still 


unfinished. 


An employce of the census office, Herman Hollerith, had designed an automatic tabulating system as 
part of his doctoral dissertation. One account states that Billings suggested to Hollerith that hand tallying 
of statistics could be replaced by a machine similar to the Jacquard Loom. Hollerith invented the punch 
card to represent data gathered for the census. The cards were the size of a dollar bill and had holes 


punched in them to répresent the census items such as sex, age, occupation, and education. It greatly ` 


reduced tabulation times. By the time the eleventh census came around, Hollerith’s system was so 
improved that he was able to determine the population of the entire country within one month after the 


return of all census data. 


In 1896, Hollerith started the Tabulating Machine Company to make cards and the machines to process 
them. It was immediately very successful. The company went through several name changes until 1924, 
when it was renamed the International Business Machines Corporation (IBM), the name by which it is 
known today. - 


Harvard Mark I 


In 1937, Professor Howard Aiken of Harvard University proposed a new type of high-speed automatic 
calculating machine controlled by a coded sequence of instructions. In 1939, under the dircction of 
President Thomas J. Watson, Sr., 1BM Corporation decided to design and build the machine. Professor 
Aiken and several IBM engincers established that the machine's requirements could be met through a 
combination of components from IBM’s standard line of calculators and a number of specially-made 


mechanisms. 


‘The machine consisted of 78 adding machines linked together. Containing nearly 750,000 parts and 
more than 500 miles of wire, the massive machine was 51 fect long, 9 feet high and weighed S tons. 
Completed in 1943 and installed at Harvard in carly 1944, the Automatic Sequence Controlled 
Calculator, named the "Mark I" at Harvard, brought Babbage’s principles of the Analytical Engine to full 
realization. 





‘A Fourier series is an infinite sum, cach term of which is cither a sine or cosine. 
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Computer Science: The First Course 


‘Though computers with a limited ability to speak are commonly used in arcade games and automobiles, 
it is still impractical to use computers conversationally. A computer that can speak and hear a vocabulary 
of 20,000 words is possible, but not practical for the ime being. By comparison, Ure English language has 
on the order of 1,000,000 words, but no one person possesses more than 100,000. 


Given examples such as these, we might be tempted to think that today’s practical limit to computing is 
tomorrow's off-the-shelf technology. But the theory of computing has shown that there is a real limit to 
what can be computed! Enough mathematics was known prior to construction of the ENIAC and the 
Harvard Mark | to demonstrate the limits of computation. In the carly 1900's, mathematicians’ from 
several apparently divergent arcas of the field had come to the same conclusion. 


David Hilbert 


‘Around the turn of the century, it was believed that mathematics, and possibly formal science, could 
answer its questions unambiguously. ‘This meant that any mathematically formulated statement could be 
proved to be cither truc or false. David Hilbert stated the position in his famous address of 1900 [10]: 


As an example of the way in which fundamental questions can be treated I would like to 
choose the thesis that every mathematical problem can be solved. We are all convinced of 
that. After all, one of the things that attracts us most when we apply ourselves to a 
mathematical problem is precisely that within us we always hear the call: here is the problem, 
search for the solution; you can find it by pure thought, for in mathematics there is no 


ignoramus. 


What was really exciting about Hilbert’s approach (called Hilbert’s Program) was his conviction that 
truth was absolutely knowable. Hilbert maintained that statements were cither true or false and that 
mathematics was, the means by which the truth could be established. To Hilbert, nothing was 
unknowable; there were only unanswered (but answerable) questions. The implication was that problems 
that had been around for decades, centuries, and even millennia could be solved! In fact, the real 
problem was to find and state problems: The proof, be it truc or false, would follow logically. 


Hilbert’s Program begins with the axioms of logic, known theorems, and by successive applications of 
inference rules, any proposition or its negation can be proved. The connection with computing is that we 
might hand over to a computer the task of recalling the rules of logic and correctly applying them, 
Thinking of Babbage’s Analytical Engine, it is casy to see that by building the axioms of logic and rules of 
inference into the machine, we can make it correctly and swiftly begin to produce new theorems! 


Kurt Gödel 


Hilbert’s program was exciting to mathematicians of his day. Imagine how powerful you could be with 
an automatic theorem prover at your disposal. However, in 1931, Kurt Gödel showed that in the 
mathematics that underlics numbers (addition and multiplication), there exist propositions. which are 
both true and unprovable! This was really a direct and complete contradiction of Hilbert’s Program. 
Gödels proof showed that there was a limit to what can be represented and proved logically some truce 
statements simply cannot be proved within the system from which they are drawn. 


Alan Turing 


` 


Alan Turing characterized the computer in abstract terms. The most important of his abstractions, 
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1. Foundations of Computing 


When we think of the modern computer, many of us picture in our minds those amazingly clever little 
devices with high resolution color graphics and voice synthesizers on display in neat little shops among 
the pharmacics and shoe stores at the local shopping mall. Some of us may associate that mental picture 
with wiz kids who design and develop sophisticated software packages to do everything from game 
playing to quantum mechanics. While that mental picture isn’t entirely accurate, it isn't entirely false 
cither. As Seymour Papert points out in Mind-Storms [12], children with little or no training operate and 
program computers as they Iearn to read. According to the best estimates of experts, we are in the middle 
of a technological revolution of computing and robotics that may have a more profound impact than the 


Industrial Revolution. 





= While the computer may scem like a very recent invention, its roots reach back hundreds of years. 
‘Those compact little machines in the shopping mall have resulted from ideas pondered over for centurics 
by some of the finest minds the earth has known. In this chapter, we will trace some of the vital events 
and ideas that formed the foundation for the development of computers and computing as we know them 


today. 


4.1 Automatic Computing 


As recently as the seventeenth century, the ability to perform ordinary ‘arithmetic was not common, 
even among the educated classes. Yet the need for computations was well advanced. Therefore, many of 
the problems that we would find relatively casy to solve today were terribly difficult for our ancestors to 
solve. The more difficult the problems became, the greater the chance for errors. For example, sailors 
could determine their position and effectively navigate over great distances by using a set of tables that 
showed the positions of the sun, moon, and stars on each day of the year. Although the greatest 
mathematicians of that time, such as Newton, Euler, and Laplace, worked on such tables, their 
calculations to determine these locations had to be carried out by hand. Thus many a sailor went to Davy 


Jones’ locker, duc largely to the inaccuracies of human computation. 


omputation involves navigation. The depth 


Another illustration of the need for accurate mathematical c 
know when high tide occurs to safely enter 


of water at a scaport varies with the tides. It is important to 
many ports. Tides are basically functions of the location of the sun and the moon relative to the carth. 


Nevertheless, a relatively complex harmonic analysis is required to determine the water level in a port at 
any specific time. Such computations, when carried out by hand, proved entirely too inaccurate, and far 
too many vessels ran aground. What was both needed and sought was a means of automatic computation. 


Machines which automatically computed mathematical functions date back at least to the Thirty Years 
War [7]. In 1642, at the age of 20, Blaise Pascal constructed a machine that automatically performed 
addition and subtraction. In 1673, Leibniz, co-inventor of the calculus, developed a device known as the 
Leibniz Wheel that extended automatic computation to multiplication and division. A number of similar 
machines were reported during this same period of history. ‘The important point is that some 
mathematical calculations were performed rapidly, accurately, and automatically by machines. 
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A number of other approaches to the representation of effective procedures have led to the same 
conclusion. Among those who have worked in this area are Raphael Robinson, Andrey Markov, Emil 
Post and S.C. Kleene. What is important for us to. understand is that, even with an infinitely fast 
computer with an infinitely large memory, some problems simply. do not admit to a mechanical or logical 
solution. ‘The Halting Problem is but one example. 


1.3 Modern Computing 


By modern standards, ENIAC, the Harvard Mark I, and other carly electronic computers were very 
primitive. “They were also very difficult to think about, program, and kecp running. Onc of the people - 
involved in the design and construction of ENIAC developed a good approach to the first two problems, 
His approach was so good that we still usc it today. 


4.3.1 The Von Neumann Machine 


John von Neumann, an immigrant to America from Hungary, was one of the greatest intellects of the 
twenticth century. He made fundamental contributions to the Manhattan Project (which created the first 
atomic bomb), economic and game theory, the design and construction-of computers, and other arcas. In 
addition to his scholarly abilities, his memory startled those who worked with him; he couid recite, at will, 
page after page of novels. He travelled widely to participate in the numerous projects that were of 
interest to him. His primary base of operation was the Institute For Advanced Study at Princeton 


University. 


consists of the following five parts: 


Arithmetic Unit Recall that during von Neumann’s time, computers were expected to produce results 
that were uscful in military ballistics. Many other highly numcrical uses were 
anticipated. It seemed natural to von Neumann that the computer should have the 
basic operations of arithmetic built in as a primitive hardware function. Today this 
portion of the computer is known as the arithmetic logic unit (ALU). : 


Control Unit ENIAC was a "three ring circus” with many operations going on simultancously ( this 
is referred to as parallcl processing). Perhaps this optimized some aspects of 
computing, but it made the machine a very difficult device to understand. Von 
Neumann decided that it was better to sacrifice speed for the sake of conceptual 
clarity. Therefore, he suggested making the central control operate in a strictly 
sequential fashion. It was to fetch and execute one instruction at a time. This basic 
architecture underlies almost all current digital computers. The fact that the central 
control can only address one word of memory at a time is referred to as the "von 
Neumann bottleneck”. 


Memory From the time of Babbage, it was obvious Ais memory was needed to store both data 
and the programs which operate on that data. Von Neumann provided this needed 
memory in his model of the computer. It should be pointed out that there is no 
inherent difference in the way that a computer stores data and programs. 
Programmers occasionally begin executing data accidentally as if it were a program. 
More sophisticated users actually exploit the uniformity of data and program storage 
in artificial intelligence applications. 


In 1945, von Neumann gave us his logical overview of computers [14]. The von Neumann machine _ R 
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constants. ‘This technique is in no way peculiar to the constant 41 which we chose randomly. In this way, 
the value of any polynomial can be determined from a difference table. ‘This is very important because, 
as Karl Weierstrass proved [2], any continuous (smooth) function over an interval may be approximated 
as closely as desired with a polynomial, Since any polynomial can be determined using a difference table. 
it follows that any continudus function may be computed as accurately as desired using a difference table 
which requires no multiplications. 


The above difference table is of the second order. ‘That is, all entrics in the second difference column, 
(D,), are the same constant. Babbage’s machine supported up to sixth order tables (DoDo) Still, it 
only calculated a very special class of functions; it was therefore not a gencral purpose instrument 


Babbage’s Analytical Engine embodies the concept of the general purpose computer. Simply stated, 
the Analytical Engine is a machine that works on two kinds of information: a sct of data and a sequence 
of operations. Interestingly, the automatic bank tellers that have become popular in the last decade 
operate the same way. The set of data to be operated on are the accounts of the bank customers, and the 
opcrations include, among other things, deposit, withdrawal, and interest calculations. 


Successful people seem to have the ability to learn from and extend the good ideas that are around 
them. It appears that Babbage derived the idea for his general purpose computer from the Jacquard 
loom, which Joseph Maric Jacquard developed at about the time of the French Revolution. Before the 
Jacquard loom, a weaver had to know the pattern to be woven. The weaver would follow a set of 
instructions that explained which thread to use as the fabric was created. Jacquard used cards with holes 
punched in them to encode the desired pattern. The holes allowed only certain threads to be engaged by 
the shuttle on a given pass. This may be a little hard to visualize, but the principle is essentially the same 


as that of a player piano. 


Coming up with the idea for such a general purpose computing machine was a stroke of genius; 
actually implementing such an idea required engineering skill and perseverance of equal rarity. Babbage 
designed a device that used a set of punched cards to describe the variables (data), and another sct of 
cards to describe the operations on those variables. ‘The machine was to inspect an operation card, 
complete the desired activity on the appropriate variables, and then proceed to the next operation. In this 
way, a sequence of instructions (i.e. a program) was to be executed. The technology of the day did not 
facilitate construction of such a machine. and although Babbage’s insight and energy were legendary, he 
died before completing work on this machine. : 


Babbage’s work is known largely because of communications with The Royal Socicty of Science in 
London. Babbage traveled in excellent intellectual circles, counting among his acquaintances Dickens, 
Darwin, Bessel. Fourier, Mill, and Browning. One close friend was Augusta Ada Byron, Countess of 
Lovelace and Baroness Wentworth. Ada was the brilliant daughter of the British poct Lord Byron. She 
not only grasped the idea behind the Babbage Analytical Engine, but also wrote of it more effectively 
than Babbage himself. Her plac? in the history of computing was assured when the U.S. Department of 
Defense named its new and advanced language, Ada, in her honor. 


—————————————— 


lTrade mark of DoD 
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complexity and detail. What was needed was a way for the user (programmer) to communicate with 
ENIAC’s hardware conveniently. ‘Yo solve the problem, they needed a language that people could use 
easily. Von Neumann made the decision to build such a language, and the task of implementing it was 
given to Adele Goldstine. The result was a slower ENIAC, but onc whose uscfulness was greatly 


improved, 


Machines have a definite, limited vocabulary. When instructed in its own language, a machine can 
operate at optimum speed. Unfortunately, machine language is very difficult for people to understand 
and use. ‘Therefore, languages that are casier for people to use have been developed. Indeed, one view of 


the development of programming languages is that it has provided more convenient and straightforward - 


ways to express a sequence of instructions to the machine. These languages are called high-level 
languages. Unfortunately, the computer doesn’t “understand” high-level languages. ‘Therefore, special 
programs have been developed to translate high-level languages into machine-language. ‘These programs, 
called compilers or interpreters, take a program in a high-level language as input and produce an 
equivalent machine-language version as output. Part one of this text is devoted to programming 
methodology in conjunction with one such high-level language, Pascal. In part two, attention is paid to 
the way that computers interact with these sets of instructions. 


1.4 Areas of Computer Science 


Computer Science can be viewed as an academic discipline that is composed of four areas of study: 
computing systems, programming systems, artificial intelligence, and theory. A number of alternative 
taxonomies of the discipline exist, but we choose to develop this one. While the discipline can be divided 
into these areas, there are overlaps among these subfields. 


1.4.1 Computing Systems : 3 ` 


The purpose of computing systems research is to investigate ways to improve the hardware aspects of 
computing. Most of the effort here is spent to make faster and more powerful computers. Research in 
this area focuses on hardware issues such as the design and development of multi-processing machines, 
distributed networks of machines, and the super computers of the future. Much research in this area is 
ongoing to develop techniques for overcoming the limitations imposed by the von Neumann bottleneck. 


1.4.2 Programming Systems 


Research in the area of programming systems focuses on the software aspects of computing. The 
purpose of this research is to develop tools that will make computers easier to understand, use, and 
program. Classic examples of such tools include compilers, editors, and operating systems. More recent 
examples include electronic mail systems, computerized bulletin boards, and document preparation 
systems such as the one uscd to typeset this book. Duc to years of research and development within this 
area, many people who were completely baffled by computers a few years ago now usc them daily. 


1.4.3 Artificial Intelligence 


Artificial Intelligence as a field dates to a mecting at Dartmouth University in 1956. 


Since there is no agreed-upon definition of intelligence, the notion of artificial intelligence cannot be 
clearly defined. However, ‘Turing proposed a test to determine if a computer exhibited intelligence [3]. 
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‘The Harvard-IBM Mark I was the first automatic digital computer. At the time, it was also the largest 
electromechanical calculater ever built. Jt could perform table look up and the four fundamental 
arithmetic operations on numbers up to 23 digits long. Both input and output were performed through 
punched cards. 


Although it took 6 seconds for onc multiplication and nearly 12 seconds for one division, the Mark I's 
flexibility was demonstrated by its solutions to various defense problems, including logistics, ballistics 
tables, and a highly secret mathematical simulation of the first atomic bomb. lts suitability for 
computations insured its continued operation for over 15 years. ‘Though this electromechanical machine 
was soon surpassed by. completely electronic machines, it was’ the first completed and operational 
automatic, general-purpose, digital calculator. It fulfilled Babbage’s dream. 


ENIAC 


The Electronic Numerical Integrator And Computer (ENIAC) was the first completely electronic digital 
computer. The Federal Government funded the development because the military desperately needed 
help to aim its artillery accurately. Accurate long range artillery fire demanded either precise tables or 
some means of computing the ballistics. The technology for hurling projectiles had far outpaced the 
physics required to determine where they would come to earth. For example, Germany had built a 
powerful gun carly in World War I. Much to the Germans’ surprise, the gun’s range was about twice what 
they expected. It turned out that, among other factors, artillery shells were piercing atmosphere that was 
only about one half as dense as that near sea level. The theories that underlay ballistics simply had not 
taken into account such complications. Automatic computation seemed to be a logical approach to 


solving such complex problems, since manual computation was clearly unsuitable. 


The Ballistics Research Laboratory was responsible for the development of ENIAG, which consisted of 
approximately 18,000 vacuum tubes, 1,500 relays, 70.000 resistors, 10,000 capacitors, and 6,000 swiiches. 
It was 100 fect long, 10 feet high, and 3 feet wide. The machine used pairs of vacuum tubes to form the 
basic units of memory. ‘These pairs were called flip-flops because at all times exactly one of the tubes was 
conducting and the other was non-conducting. We say that the flip-flop is in state 1 when the right hand 
tube is conducting and state 0 when the left hand tube is conducting. The machine operated on base 10 
(decimal) arithmetic. : 


Work on ENIAC began in 1942 and was completed in June of 1946. However, ENIAC was extremely 


difficult to program, because it required circuit modifications to change its program. It wasn’t until 1947 
that ENIAC was modified to be a computer in which a program could be stored, like the Babbage 


machine or the Jacquard loom. 


4.2 Theoretical Underpinnings 


Computers have always been designed to use recent advances in the enginecring ficlds. This has 
tended to limit the field of computer science to people with backgrounds in engineering, mathematics, OF 
the physical sciences. There is a more fundamental sense in which mathematics supports computing. It is 
the intellectual heritage of the foundations of computing that we look at now. 


There are practical limits to what can be computed. In the early 1970's, it appeared to experts in the 
ficld that there was a practical limit that would prohibit machines from playing chess better than human 
Grand Masters [11]. Today, there is a program named Bell which runs on special hardware and plays 
chess at the expert level. It was developed by Ken Thompson at Bell Labs. 
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Algorithm Analysis 
Algorithm Analysis ts concerned primarily with the specd of algorithms, but it is also 


concerned with the amount of space algorithms require in computer memory, As will 
be shown later, there are very profound cffects depending on which of several 


algorithms is implemented. 


Numerical Analysis 
It is important to realize th 
machine, but it is used to represent, 
system. The computer does not represent the 
arca studies the nearness of results and determines how to encode v 


information in the best ways. 


at we are dealing with a digital computer. It is a finite 
among other things, the in finity of the real number 
se numbers with complete accuracy. This 
arious types of 


Verification ‘The study of program verification addresses one of the fundamental questions of 
gram, how do we know that it computes what the 


computer science: Given any prt 
+ ‘This arca investigates ways of verifying or proving 


programmer intends it to compute. 
programs to be correct. Some of the elementary techniques will be introduced later in 


this book. - 
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known as the Turing Machine, is a preciscly defined, generalized conception of a computer. It is a 
rigorous formalism representing computation, an abstract machine rather than a physical device. ‘The 
Turing Machine has three fundamental components: a tape, a read/write head, and a state machine (sec 
figure l-1 helow). ‘The tape is infinite in two directions, and it is divided into cells. The read/write head 
may be shifted in cither direction. ‘The state machine contains a finite nuinber of internal states. ‘The 
contents of the current tape cell and the machine's internal state determine whether to write a symbol on 
the tape, what symbol to write, whether to move to the next cell on the tape, and what the next state is. 


Turing showed that given such a mathematical idealization, there is no way to determine if an arbitrary 
program, unce started, will halt. This is known as the Halting Problem and is, in a sense, equivalent to 
Gédel’s result. Turing’s representation now underlies theoretical computer science. Unfortunately, the 
contributions of this genius were cut short by his sudden death at the age of 42. In recognition of the 
importance of Turing’s contribution to the ficld, the Association for Computing Machinery (ACM) 
namcd its prestigious award after him. Annually, the ACM presents the Turing Award to the outstanding 
computer scientist of the year. 


State 
Machine 





Figure 1-1: Turing Machine 


The representation of cornputation that Turing used came more than a decade prior to the first 
electronic digital computer. It appears quite similar to the magnetic tape that is used in computing today 
and provides a convenient illustration for a number of concepts that will be used later in this text. 


“Alonzo Church ` 


Alonzo Church also worked in the area of computability. Using a set of mathematical functions now 
known as the lambda-calculus, he established an equivalence of computing power between the lambda- 
calculus and the Turing Machine. Moreover, he also showed that the procedure for which Hilbert 
scarched could not possibly exist. 


Church attempted to make the Turing Machine more powerful by adding a varicty of enhancements. 
Among other things, he tried an infinite number of tapes, an infinite alphabet of symbols, and random 
access of the read/write head. He found them all to be equivalent to the Turing Machine. It is Church's 
Thesis that the Turing Machine (often referred to as the Universal Turing Machine) is the most powerful 
model of computation and cannot be made more powerful. Since Church, many others have tried 
unsuccessfully to make a more powerful abstract machine. 


File Date: January Ist 1985 at 16:41 ; ; Scribe Date: January 2nd 1985 at 00:36 
File Name: CHAPTER ILMSS.442 line 445 Seribe Version: 4(1-100) 





Many important results in this field, proving the unsolvability of certain ` 


problem classes by computer, save much futile programming effort. 


To formulate the theory, a mathematically precise model of computer ` 


and algorithm for it is necessary. Several such models have been developed, 
of which the best known is the Turing machine discussed in Section B. The 
basic approaches of the theory are presented here with the use of proce- 
dures, which are somewhat informal but close to those of the higher level 
programming languages. 


To establish the capabilities of machines, the theory of computability _ 


analyzes what transformations of inputs into outputs they are able to per- 
form or, in other words, what functions may be computed by them. It is not 
necessary to consider directly all possible functions with various numbers 
and types of arguments, since the universe of discourse may be narrowed 
with the use of an important tool known as Gödel numbering. This signifi- 
cantly simplifies the analysis needed to determine the limitations of auto- 
matic computation. 

Gödel numbering makes it possible to consider, in the development of the 
theory, exclusively functions with single arguments ea are poner? in- 
tegers. 

This device is subsequently aie to prove that even in this limited area of 
consideration there are functions that are not computable; i.e., there are 
problems that cannot be solved with computers. The unsolvability of the im- 

rtant halting problem is further proven. 

The argumentation of the proofs is as important as the results siicrigehaniss 
since it is typical for the theory of computability. 


1. FUNCTIONS AND THEIR COMPUTASILITY 


The theory of computability attempts to establish which functions} can be 


computed by machines. 

A function (called also mapping) is a rule that associates with each ele- 
-ment x of a set D (called the domain of the function) a single element y of a 
set R (called the range of the function). The element x is called the argu- 
ment; and the element y, the value of the function for this argument. These 
notions are illustrated in Fig. 11-1.. , l 
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Input : Von Neumann designed a general purpose machine. A general purpose machine 
operates on many different programs and on various sets of data. In order to get new 
information into the memory of the machine, there must be some sort of input device. 
Thus von Neumann provided input capability in his design. 


Output Output is necessary if the computer is to communicate the results of its computations. 
Von Neumann provided for outpul devices in his model of the computer. 


1.3.2 The Modern Computer 


The von Neumann machine is realized in many ways today. Typical computers have a unit known as 
the arithmetic logic unit, or ALU, which implements arithmetic in hardware. Digital computers represent 
numbers and mathematical operations in various ways: Some optimize floating point arithmetic 
representations; some are built to perform operations on arrays, others are built to minimize cost. In this 
last class of machines, the ALU is a single silicon chip, sometimes integrated with the central control. 


The central control of the von Neumann machine is now referred to as the central processing unit, or 
CPU. The CPU sclects the next instruction from memory, decodes it, fetches needed data, stores results, 
and then selects the next instruction. ‘This process can be implemented in a variety of ways, some of 
which include a single chip. Others are composed of a mixture of hardware and software, while still 
others require the cooperation of many computers and CPUs. 


There are a number of currently popular implementations of memory as well. Most machines 
distinguish between primary and secondary memory. Primary memory is fast but expensive, while 
secondary is slow but cheap. Primary memory has consisted of vacuum tubes, crystals and standing 
waves, ferrite doughnuts strung together in a lattice called core, semiconductors, aswell as some other 
implementations. . Secondary memory has been built using paper tape, punched cards, magnetic drums, 
and magnetic tape. Magnetic disks are very widely used right now, but for applications where 
information.is to be written only once and read often, optical techniques are beginning to show promise. 


Input devices are diverse and numerous. They range from thermocouples and ultrasonic devices to the 
familiar keyboard of a terminal. Fuel injection systems on automobiles have input devices, and the 
indestructible joy-stick on the Pac-Man game is an input device. Paper tapes were once used, programs 
were dialed in, and circuit boards were rewired. In many places, a punched card, much like the original 
Hollerith Punch Card, is still being read by a high-speed card reader. Microphones, which have their 


signal digitized, and cameras used in computer vision systems are also input devices. 


Output devices are probably as numerous and varied as input devices. The printers used with word 
processing systems are widely used output devices. The cathode ray tube (CRT) used in video display 
terminals is another. Less obvious output devices include tape and disk write heads. More exotic output 
devices include oscillators that produce synthetic sounds similar to voices and musical instruments. High 
resolution color graphics terminals are also output devices. 


1.3.3 Programming Languages 


Computer scientists discovered carly that a general purpose computer wasn't too useful if it was 
extremely hard to program. Difficulty in programming was definitely a problem with ENIAC. It took 
experts days to program the slightest changes. ‘The task required the experts to restructure the hardware 
for cach new application. The ENIAC programming manual consisted of a thick volume of staggering 


Kile Date: January Ist 1985 at 16:41 Scribe Date: January 2nd 1985 at 00:36 
File Name: CHAPTER LMSS.442 line 574 Scribe Version: 4(1400) 





Gib 





Wil a SINBICAFRUMICHG WHOSG CShigit APU FONG oe ee 
(also called natural numbers). A Gödel numbering is an encoding that con- 
verts multiple arguments as well as arguments of other types to a single 
nonnegative integer. This encoding may be used also‘to represent the mpoo 
to the procedure that computes the function. 

Several nonnegative integers Xi, Xz) +++» Xn that are inputs to a procedure 
may be uniquely represented by a single integer z as follows: 


z= 2h. 3. aap 


‘where 2, 3,..., p are the n first prime numbers. 
The cane apaa X Xa «+ +) Xn May be recovered from the integer z 
since every integer has a unique decomposition into dhas 


EXAMPLE 11-2 
(a) The three inputs 
3 xX, =4, x, =1, x3=2 
may be encoded as follows: 
23 of 2a 


(b) From a Gédel number z = 140, the following inputs are uniquely re- 
covered: 


x, 2, x80, my i ae r. 
since 140 = 22. 5.7 


Inputs that are strings of any nature (e.g., real numbers or character 
data) may be individually encoded into their Gödel numbers with, for ex- 
ample, the use of their character code representations. 


3. EXISTENCE OF UNCOMPUTABLE FUNCTIONS 


The following argument proves the existence of functions that map nonneg- 
ative integers into nonnegative integers and that are not computable. 

Let us assume that the set of all such functions over nonnegative integers 
is enumerable. This means that each function may be assigned a unique in- 
teger number: 1, 2, 3. 

Let us then call Ea PA EF. a E RT 

We will now construct another function; 


H(k) = Fy(k) +1 for k=1,2,3,. 


This function aiso maps nonnegative integers into Aontas integers. It 
is, however, neither one of the functions of the set Fi, Fa» Fs -- , since for 
any k it is by 1 greater than F(X). 

We have therefore proven by contradiction that the set of all functions 
over nonnegative integers is not enumerable. 

The set of algorithms that may be constructed to compute such functions 
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In the Turing ‘Test, there are three participants: the machine to be tested, a human, and an interrogator. 


‘The object for the interrogator is to distinguish the human from the machine, ‘The object for the machine 


is to foil the interrogator. If the machine can do so, it is considered to exhibit intelligence. A machine 
exhibiting intelligence in this fashion is said to possess artificial intelligence (Al), or machine intelligence 


(as opposed to human intelfigence). 


Artificial Intelligence is well illustrated by the subject matter with which it deals. The following topics 
have received a good deal of attention in Al: 


two-person games There arc now chess playing programs that compete at the expert level. Recently, a 
backgammon program defeated the human world champion in Monte Carlo to win the 
$10,000 prize that was offered. The current world champion Othello player has 
declined to play the reigning world champion Othello program! 


problem solving One of the first areas to which attention was directed was the proof of theorems. The 
idea was not to simply apply brute force, but to use some heuristics (educated guesses) 
in the proof of interesting theorems. Other domains of problem solving include the 

composition of mclodies and the extraction of plots from short stories. 


diagnosis For some years now, medical diagnoses have been made by a program named 
MYCIN. It uses knowledge of bacteriological phenomena and is able to make expert- 
level inferences. Given a sct of symptoms, MYCIN will attempt to determine the 
cause for the illness. The Digital Equipment Corporation has a similar expert program 
to configure its computing systems. 


: perception One goal of computer science is to enable the computer to engage in speech and vision 


- communications. Systems with vocabularies of over 10.000 words which can both hear 
`. sound and synthesize appropriate aural responses exist. Vision systems to interpret 


visual information are useful for a wide range of applications, including monitoring the . 


military actions of unfriendly nations via satellites. 


robotics No discussion of artificial intelligence would be complete without reference to the 
robotics effort. Dangerous, routine, and error prone jobs are being turned over to 
robots. Not only is this area technologically important, but itis socially significant as 
well. Many jobs presently performed by people will be soon taken over by machines. 
However, it isn’t clear what should be done to ensure employment for the workers 


displaced by robotics. 


1.4.4 Theory 


There are a number of areas which make up theoretical computer science. We only list four of them 
here. Each of these four is a heavily rescarched arca. 
Computability Theory 
The Theory of Computation studies the limits of computation, what is computable, 
and models of computation. Some taxonomies would include automata and the theory 
of formal languages here. Connections are, among other things, to the design of 
compilers. 
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We may easily obtain the Gödel encoding of the procedure INVERT; let 
us assume it to be X. | 

Let us submit the Gödel number <X, X> as the input to the procedure 
HALT. The result of the exccution of the procedure HALT should be: 


1 if INVERT (X) halts 
HALT (<X, X>) = pe 
0 if INVERT (X) does not halt 


- However, considering the procedure INVERT, if HALT (<X,X>)=1, 
then the while-do statement constitutes an infinite loop and the procedure 
INVERT does not halt! 

Since we were able to find a procedure INVERT for which the halting 
problem cannot be solved, no general-purpose program that would decide on 
the termination of other programs for an arbitrary input is possible. 

Another important example of an unsolvable problem is the so-called 
equivalence problem, namely, there is no program that can determine for 
two arbitrary other programs whether they are equivalent (i.e., whether 
they compute the same function). Like many other theorems on the theory 
of computability, this is proven by reduction to the halting problem. In oth- 


er words, such proof amounts to establishing that if the given problem were _ 


solvable, the halting problem would also be solvable. . 


5. COMPUTATIONAL COMPLEXITY 
The theory of computability is complemented by the study of computation- 
al complexity, the investigation of run time and memory space required for 
the solution of major classes of problems. This theory has established a 
number of bounds on such resource requirements and thus guides the pro- 
grammer on the design of better algorithms. 

An example of such a result is that the sorting of items requires as a 
minimum on the order of nlog,n comparison operations. 


B. TURING MACHINES 


In order to determine the computability of functions it is necessary to have 
a precise mathematical definition of algorithms. No such precision is possi- 
ble with real computers (refer to Part One for the description of such algo- 
rithms, which is far from precise). i z 


A number of theoretical constructs have been devised in which an exact . 


definition of computation and of an algorithm is possible. The best known 


are the Turing machines; others include Markov algorithms, lambda calcu-~ 


lus, and the theory of recursive functions. 
A Turing machine is the most powerful of automata. An automaton is an 


idealized machine for information processing whose actions are specified in 


D. ; 
ae m Em- 





oe = 


GS 


ad 


| CGOwirPUT ABILITY, 
TURING MACHINES, AND 
FORMAL LANGUAGES 


“~ 


This chapter introduces important theoretical areas of computer science 
with a minimal formalism. | 

The theory of computability outlines the limitations of computers, usually 
by establishing the classes of problems that cannot be solved by machines. 
Actual computers do not lend themselves well to a formal description that is 
to be manipulated by the tools of mathematics. Therefore, precise theoreti- 
cal models whose capabilities are conjectured to be equivalent to those of 
computers have been used in the development of the theory. The models 
most often utilized for this purpose are Turing machines. _ 

The study of language processing by computers led to the theory of for- 
mal languages. One of the practical aspects of this theory is the aid it pro- 
vides in the translation of programming languages. Formal languages are 


sets of strings formed according to strict rules defined by their grammar. 


The theory of formal languages is closely related to the theory of automata 
that analyzes abstract information-processing machines. The most powerful 
of these machines, in terms of their computational capabilities, are again 
the Turing machines. A class of automata exists for each of the four classes 
of formal languages that recognizes the strings of the language by answer- 
ing the question, is this string formed according to this grammar? 


A. THEORY OF COMPUTASBILITY 

The theory of computability explores the limitations of computers by estab- 
lishing what can and what cannot be computed by a machine. In such inves- 
tigations the practical matters of the computation time and memory space 
required are of no concern. 





Quintuples have the following general form (although notational vari- 
ations abound): 


qi» ti, tj d;, qj 
where 


q; is the present state of the machine; 
t, is the tape symbol read by the read/write head; 
t; is the tape symbol to replace the one being read © = 
(possibly, t; = tj, i.e., the read symbol is not replaced); 
d, is the prescribed direction of head movement 
(R for right, L for left, N for no move); 
q; is the state to be assumed next by the machine. 


A quintuple should be interpreted as follows: 


if q; t; then 
tis dis qj- 
else 


\ 


this quintuple does not presently apply 


. 
. «if ‘hea 


The flow of control within the program is accomplished by the machine - 
selecting the quintuple that applies in the current state with the given sym- 
bol under its head. ae 

The states are denoted as qo, q1» - - -> qa and HALT. At least in one quin- 
tuple the next state symbol ought to be HALT to provide for the termina- 
tion of the computation. 

To describe a Turing machine, the following has to be specified: 


= 


the set of tape symbols, called the tape alphabet - 
(it always includes the blank symbol); 

the initial state of the machine; 

the representation of the input data on the tape; 

the initial position of the head; 

the representation of the output(s) on the tape; 

the program itself. 


The following examples present two Turing machines. 


EXAMPLE 11-3 3 ; A 


Problem ; - 
_ Design a Turing machine to increment any natural number (nonnegative 
integer) by 1. om ee ve 
Solution 


> am| tape alphabet: b (blank), 1. 
& 2] initial state: Qo ` te 





(domain) 
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FIGURE 11-1. A function y = f (x), where y is the valuc.of f for the argument x 


A function may be specified as a collection of ordered pairs (x, y) or as a 
procedure for the computation of the value for all arguments. 


EXAMPLE 11-1 
(a) The function 
| f(x) = 2x 
has as its domain and range the set of all real numbers. 
` (b) The function . 3% 


oes oe : f(x) = # 
has as its domain the set of all real numbers and as its range the set 
consisting of 0 and positive numbers. : 


A function is computable if and only if there exists an algorithm 
(see Chapter 1) that for any x belonging to the set D computes the value of 
y =f (x). If a function is computable, a program that computes any such 
value may be implemented. 

Related to the computability problem are the problems of enumerability 
and decidability, both of which may be reduced to computability. A set D of 
elements with a given property is enumerable if and only if there exists an 
algorithm that either determines that the set is empty or enumerates (lists) 
‘all the members of the set. A set D is decidable if and only if there exists an 
algorithm that can determine whether a given element belongs to the set or 
not. In other words, the problems consist in enumerating all the elements 

with a particular property ‘(e.g., all even numbers) or deciding whether a 


given item has the given property. 


s 





Notes _ : ae 

1. Note that the answer is K = 3, which indeed equals the original 2 + 
is Speer i 

2. Check that the program works for special cases, for example, 0 + 0. 

3. Which decisions in the writing of this program were necessary and 
which arbitrary? : 
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While each computation requires a different Turing machine, a universal 
Turing machine exists that is able to simulate any other Turing machine. 
Thus, the tape of a universal Turing machine fulfills Church’s thesis by be- ` 
ing able to compute any computable function. 

A given problem is solvable, that is, there exists an algorithm for its solu- 
tion, if and only if, when this algorithm is presented as a program for a Tur- 
ing machine, the machine halts. There is, of course, as shown above, no gen- 
eral algorithm that would establish this for an arbitrary program: the 


halting problem is undecidable. 


©. FORMAL LANGUAGES 


Languages are characterized by their syntax and semantics. Language syn- 
tax, which lends itself to a formal definition, permits the classification of. 
the language as belonging to one of the four categories of the so-called 
Chomsky hierarchy of formal languages. The richness of natural languages 
surpasses, however, the possibility of such formal definition, while higher 
level programming languages can be approximated by the so-called context- 
free grammars of the hierarchy. A compiler for the language so defined 
may use this definition to parse programs. 


4. SYNTAX AND SEMANTICS : 

Both natural and programming languages may be described by their two as- 
pects: syntax and semantics. Syntax is the grammar of the language: it de- 
fines which strings of characters are valid constructs in the language. The 
syntax of a programming language determines which of the programs that 
are written in it are valid. Semantics defines the meaning of the language 
constructs. Even though a program may be syntactically correct, if its 
meaning is other than that intended by the programmer, the result of its ex- 
ecution will usually not be a solution to the problem. 


EXAMPLE 11-5 


In a natural language, English, the following two sentences have identi- . 
cal syntactic structure but certainly different meaning: A DOG BIT A 


MAN vs. A MAN BIT A DOG. 





Lab 


is, however, enumerable. For example, each such algorithm (or procedure), 


treated as a string of symbols, may be assigned its unique Gödel number, 


which proves the point. i 
We conclude, therefore, that there are more functions over nonnegative 


„integers than there are algorithms available to-compute them. This proves 
_ the existence of functions that cannot be computed. 


4. THE HALTING PROBLEM AS AN EXAMPLE 
OF AN UNSOLVABLE PROBLEM 


The theory of computability has established many negative results, that is, 


has identified functions that cannot be computed or, in other words, prob- 


lems that cannot be solved by machines. 
A most important unsolvable problem (i.e., proven not to have automatic 


solution) is the halting problem. 


The definition of the halting problem is contained in the following theo- 
rem: there does not exist an algorithm A that can decide for an arbitrary 
program P and input I whether the program P will stop when presented 
with this input. ; 

The usefulness of such an (alas, impossible) algorithm in’ computing 
would be obvious. = 

The following argument proves the unsolvability of the halting problem. 
To represent algorithms, procedures resembling functions of programming 
languages are used; i.e., a value is assigned to the name of the procedure 


and is treated as a variable name in this respect. 


Let us assume that we indeed have a procedure HA LT that is the solution 
to the halting problem. The procedure, in pseudocode, looks as follows:T 


Procedure HALT (<P, I>) © 
‘begin 
statements that determine whether P(I) halts; 
if P(1) halts then 

HALT<1 

; else . ; 
HALT—0 

- end 


` 


If such a procedure exists, we can also define the following procedure: 


Procedure INVERT (Z) 
begin ; 
while HALT (<Z; Z>) = 1 do 
INVERT]; * OR ANOTHER ASSIGNMENT TO INVERT * 


INVERT—0; * ALSO ANY ASSIGNMENT TO INVERT WOULD DO * 


end 





symbol | is used here instead of the word “where” (and not “or’!) and the 
limitations on zn and n following it specify that at least one x and one y are 
to appear in any valid string. 


For example, according to the above specification, the following strings 


belong to the language L,: 
) xxx + yy, x + y, X+ yyyyy A 
while the following do not: 
xy (+ has to separate x and y); 
x+ (n = 0) 


A fruitful alternative way to specify a language is by presenting its gram- 
mar. The grammar is built around the rules for generating all the strings of 
the language. These strings are obtained (generated) by a consecutive appli- 
cation of these rewriting rules (called also productions). 

A graminar of a language consists of: 


terminal symbols: the symbols that may appear in the strings of the lan- 
guage (i.e., belong to its alphabet); 


nonterminal symbols: used only in the intermediate constructs during the 


eneration of a string by the application of productions; 


the sentence symbol: a selected nonterminal symbol, used to start the gen- - - 


eration of all of the strings of the language; 
productions (rewriting rules) of the general form: 


string-1 — string-2 


‘which means that the left-hand string may be replaced by the right- 
hand string. 


Generation of a string of a language defined by the given grammar pro- 
ceeds as follows: 


(1) A production whose left-hand side is the sentence symbol (usually 
denoted S) is used first. Such a production (or several such produc- 
tions) is always part of the grammar. 

(2) A production may be applied to an intermediate string if the left- 
hand side of the production appears in this string. Such productions 
are applied, and thus a sequence of intermediate strings is obtained. 

(3) The generation of the string is completed when no production can be 

applied to it. If this string contains only terminal symbols of the lan- 
guage, this is one of its strings. A s 


Therefore, a string of a language is generated through a sequence of in- 
termediate strings, called the derivation of this string. To connect the 
EE strings used in the derivation, the double arrow (») symbol is used. 





e a 


n 


See 


i : i j z9 ba | 





mathematical terms. What has become known as Turing machines was in- 
troduced by the British logician Alan Turing in 1936 and thus preceded the 
development of general-purpose electronic computers. A conjecture (as such 
it cannot be proved, but strong evidence exists in its favor), known as 
Church's thesis claims that a function is computable if and only if it can be 
computed by a Turing machine or a construct of an equivalent power. Thus, 
the power of these, as it would appear, “simple” machines is equivalent to 


` the power of computers. 


A Turing machine is an automaton that consists of a control unit, a tape, 
and a read/write head (see Fig. 11-2) together with the program designed 


for it. 


tape 






a square read/write head 


control unit 


FIGURE 11-2. A Turing machine 


The control unit of a Turing machine holds the information concerning 
the current state of the machine. A Turing machine can be in one of the fin- 


-. ite number of states, determined by the algorithm, called a Turing machine 


program. A state reflects the past history of the computation due to the giv- 
en program. The program is assumed to be “built into” the control unit. 
The tape of a Turing machine serves as its memory and input/output de- 


vice. It consists of an infinite number of squares, each containing a single - 


symbol. Only a finite number of squares may contain the information need- 
ed by any computation; the rest of the squares are assumed to contain a spe- 
cial symbol,-called a blank. pacer. 

The machine is able to read and write information from a single square at 
a time, namely, the square pointed to by the read/write head. The read/ 
write head moves from one square to the adjacent one as directed by the 
program and transmits the information between the tape and the control 
unit. The tape initially contains the input presented to the machine, is sub- 
sequently used as the working storage, and ultimately provides the output. 
- The algorithm to be followed by a Turing machine is called its program. 
This is a set of instructions (“rules”) of a very restricted type. The machine 
carries out an instruction as a single step. Every instruction has the form of 
a quintuple; i.e., it consists of five ordered elements. Each quintuple speci- 
fies the action to be taken if the machine is in a given State, with the read/ 
write head pointing to a given symbol. The action may consist of changing 


he symbol, moving one square left or right, changing the state, or a combi- 


| 


A production whose left-hand side string is contained in its right-hand ` 
side string is recursive: the substitution is “defined” partially in terms of it- 
self. The second production in the Example 11-7 is of this kind. 


3. CLASSIFICATION CF FORMAL LANGUAGES 
A classification of formal languages based on the nature of the productions 
of their grammar was introduced by Noam Chomsky. in 1959. 

This classification established the hierarchy of formal languages, with. 
these placed lower constituting a subset of the higher ones. Starting from 
the top, the following types of languages are distinguished: 


(1) unrestricted languages, with any type of productions included in their 
: grammar; even these fail to reflect the complexity of natural lan-. 

guages, although they most closely approximate them; 

(2) context-sensitive languages, a model that encompasses programming 
languages but is rarely used due to the difficulty of manipulation; 

(3) context-free languages, the most important category, since it closely 
approximates higher level programming languages and is rather easi- 
iy manipulated; these are further discussed below; : 

(4) regular languages, as a very restricted type of formal languages. 


aran, 


This classification of languages applies also to their grammar. 

Formal languages are related to automata (see Chapter 11-B) in that an. 
automaton (a theoretical machine) of a particular complexity is needed to 
establish whether a string is a member of the language with a given gram- 
mar. Automata, analyzed from this point of view, are called acceptors for 
the strings of a language. Turing machines, as the most complex of auto- 
mata, recognize the strings of unrestricted languages. | ; 


4. CONTEXT-FREE GRAMMARS AND 
THEIR ROLE iN COMPILERS 
Context-free grammars are particularly important in the translation of 
higher level programming languages. Together with some additional rules, 
they may form the basis for the parser section of a compiler (see Chapter 
6-B-2). i 
The productions of a context-free grammar are restricted to the form: 


a single nonterminal symbol — a nonempty string 


This means that in a derivation, symbols are never deleted from the pre- 
viously generated intermediate string. The substitution of strings is per- 
QO formed without regard to the strings that surround it. - 
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input and output representation: an integer M is represented by M + 1 
consecutive 1’s (this is a typical representation, since it represents 0 as 
a single 1, to avoid having totally blank tape). 

initial head position: anywhere to the left of the number. 


The program Comments 
Go. b, b, R, Go - keep moving right while b under head 
Gy 1,1,L,q, ` the number is reached 


qn b, 1, N, HALT in state q, the head is certainly pointing to b 


Notes 

1. Observe how the “transfer of control” from the first to the second 
quintuple occurs. . 

_ Note that when q; = q we may have a loop; in’ our case, the first 
quintuple is such a construct. = 

_ Note the necessity of having at least two states. 

_ How would the program be modified, if the head were to start at the 


leftmost 1? 


Aaw N 


EXAMPLE 11-4 
Probiom 
Design a Turing machine to add two natural numbers. 
Solution 
tape alphabet: b, 1 
initial state: qo : 
input representation: integers N and M äre represented by N + | and M 
+ 1 consecutive 1’s respectively; the two integers are separated by a 
single b. . poet 
initial head position: at the leftmost 1 of the leftmost number 
output representation: integer K, represented by K + 1 consecutive I's. 
Program . . . : 
initial head position: 
... bbI1IbI1b... 
T i ; : 
head and tape after every step: 


Gor 1, 1, Ry Qo. ...bb111b11b ... move to reach separating b 


q, b, ERA __ bblI1111b... replace separating b by 1 
T 
q1,1,R,q, | «+. BBITITIID... 


ay reach the rightmost | ’ 
“qu b, b, L, a: == DOFO ` 


1. 
qs 1,6, L, qs _. bbIJ111bb... delete the rightmost 1 


T 
q» l, b, N, HALT... bbl1illbbb...delete the second rightmost 1 


ae o 





In a compiler the reverse process, that of identifying a program construct 
as valid, is important. During the syntactic analysis phase, the parse tree 
(form of a derivation) is produced as shown in Fig. 11-3. 

A syntax-driven parser recognizes a string (e.g., a program or a particu- ( 
lar statement) as a member of the language, i.e., a syntactically valid con- ' 
struct. 


integer 
unsigned integer ` 
unsigned integer digit 


digit 


o- ee ee 3 
FIGURE 11-3. A parse tree 
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ae syntax of artificial languages, SY" MAENE all ambiguity is 
stricted in comparison to natural languages, since all ambiguity is to be 
avoided. | : 

To make computer processing of languages (programming languages to- 
‘day and, maybe, natural languages in the future) possible, a formal theory 
of languages has been constructed. The primary concern of these math- 

~ ematical linguistics is language syntax, since no formal means of completely 


specifying semantics exists. ` | 
9, DESCRIPTION AND DERIVATION OF FORMAL LANGUAGES 


A formal language, called below simply a language, is a set of strings 
formed by the concatenation (i.¢., writing next to one another) of a finite 
number of symbols of the language in accordance with the rules specified 


for this language. 
The set of all symbo 
alphabet of the language. 
According to these definitions, 


nea{ety lye} 


is that may appear in the strings of a language is the 


the following is considered a language: 


over the alphabet nae Pa ae 


In the above notation: 


L, is the name of the language; : ; 
braces: { and } „contain the list of the strings of the language; 


the symbol | separates the individual strings and has the meaning of 
“or”, 
Thus this language Lı consists of two strings only: x + y and y + x. 
` An important distinction exists between the symbols of the language be- 
ing described (in this case, the symbols x, y, +), and the symbols used to de- 
scribe it ( { } , | ). The latter set of symbols does not belong to the alpha- 
bet of the described language, it is said to belong to a metalanguage used 
for the description of other languages. E 
Such description specifies the rules according to which the strings of the 
language are formed. In order to be able to describe infinite languages (with 
an infinite number of strings, each of finite length) or, in general, languages 
with many strings, this notation has to be extended. = 
In the metalanguage used to specify the following language: 


a es mei n=} 
fy that an arbitrary number of x and y 


of the language. Since m may differ from 
differ from the number of y’s. The 


the power symbols. and n signi 
symbols may appear in the strings 
o n, the numbers of x’s in a string may 
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LOAD J (copy J into the accumulator) A, 
ADD K (add K to the accumulator) 
STORE I (copy the accumulator into I) 


= pox | 


Different kinds of computers have different machine 
languages. Some computers have many accumulators and some have 
faw. Some computers have many instructions and some have taw. 
We will introduce common features of machine languages by , 
inventing a very simple computer. Ws will call our computer VS, 


for very simple computer. 
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The machine instructions for the VS computer are designed to 
be convenient for representing programs written in a subset of 
Pascal. The VS computer has never been built; it is just a 
hypothetical machine that we will use to illustrate points about 
computer languages. 


The instructions for the VS computer have the form 
operator operand 

for example, 
STORE I 


The operator of an instruction tells the computer what to do; the 
eperand tells the computer what to do it to. 


After the computer executes one instruction, it continues to 
the next, unless the executed instruction directs the computer to 
jump to another instruction or to skip an instruction. We can 
translate the Pascal statements 


IF I<#K THEN 
:=I+J3 
I:J} 
into the VS computer instructions 


LOAD K (copy K into the accumulator) 


SKIPLE I (if I<=accumulator, skip next instruction) 
JUMP L (jump to instruction labeled L) 
LOAD: I (copy I into the accumulator) © 
ADD J (add J to the accumulator) 
STORE K (copy the accumulator into K) 
L:LOAD J (copy J into the accumulator) ; 
STORE I (copy the accumulator into I) 


In this example, L is the label of an instruction; instructions 
are labeled so they can be jumped to. In full Pascal, but not in 
PS/k, there are statement labels and there is a GOTO statement 
that is analogous to the JUMP machine instruction. In full 
Pascal, the following statements are equivalent to the example we 
just gava: 


ch 
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EXAMPLE 11-6 


Probiom 
Let us determine what lan 
terminal symbols: x, y, t 


guage is defined by the following grammar. 


|- nonterminal symbols: 5, T 


the sentence symbol: S 
productions: 


Sx 
Sx + T 
y tee SS ge: 


Solution ee 
Either of the productions whose left-hand’side is S may be applied ini- 


tially. 
The first of these results in the derivation 


S2X 
The second leads to the derivation 


Saxt Talt rty 
Therefore, the language is 


L={x|x+x+y)} 


Sot 


EXAMPLE 11-7 


Problem 2 . : 
In the grammar of the Example 11-6, let us replace the productions by 


the following: f 
SX 
S—S + y 
Let us determine the language so defined. 


Solution 
The first production “generates the string x. If the eases production is 


applied initially, we obtain: 
Sag kar Sty ty StI tus sys 


Then, by applying the first production we gbina any string of the follow- 


-ing form: 
xtyty Hit) ; 
This infinite language may be described as: — . , 
£ 


L= { x2" [| zZ=+y. m=0} 
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of a computer. We have left out a whole set of skip 
instructions, such as SKIPGT (skip when greater than). We left 
out instructions for doing REAL arithmetic and for reading from 
data cards. Wo left out instructions for manipulating character 
strings, indexing arrays, and calling and returning from 
procedures. These additional instructions ars important in a 
real computer; if you like, you can design a "super" VS computer 
that includes them. - 


TRANSLATION OF A PASCAL PROGRAM 


If we use some care in picking our example, we can translate 
an entire Pascal program inte VS instructions. This example 
Pascal program requires only the types of instructions available 
on the VS computor: 


High-Lavel Language ; Low-Level Language 


PROGRAM T (INPUT,OUTPUT) 5 
VAR I: INTEGER} 


BEGIN : 
WRITELN(' POWERS OF 2’); PUTSTRING TITLE 
2313 LOAD ' ONE 
STORE . ae 
WHILE I<=#8 DO L1:LOAD ` EIGH 
BEGIN- SKIPLE T 
: JUMP L2 
WRITELN(I)s -7 LOAD : 
PUTINTEGER 
I:eI+l i LOAD I 
ADD I 
- STORE I 
END JUMP Li 
END. L2:HALT 


The first VS instruction in. this example has as its operand 
TITLE; TITLE gives. the location of the string ' POWERS OF 2'. 
Similarly, ONE and EIGHT give the locations of the values 1 and 
8. 


MNEMONIC NAMES AND MACHINE LANGUAGE 


Up to this point we have written VS instructions using names 
such as LOAD, STORE, I and J. These names are not present in the 
machine language that a computer executes; they are replaced by 
numbers. We will now show how these names can be translated into 
appropriate numbers. 

As you may recall from Chapter 2, the main memory of the 
computer consists of a sequence of words. The words of memory 
are numbered; the number that corresponds to a particular word is 
callod the location or address of tho word. Words can be used to 
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portant step in the development of computer science, 


defined by a context-free grammar presented in the so- 
form (or BNF notation). Since this metalanguage comes close to the full 


EXAMPLE 11-8 z 
The following are the productions of a context-free grammar: 


S xr 
Y — xZv 
Ze, 


Å. 


ar 
on constituted an im- ` 
is almost completely 
called Backus-Naur 


The programming language ALGOL, whose definiti 


specification of ALGOL, this programming language comes close to being 
context-free. >> 

In BNF notation, for economy of writing, several productions are often 
merged into one with the use of the symbol (|) that plays here the role of the 


connective “or.” : i 
The general form of BNF productions is 


<N>:= string-1 | string-2 | ..- | string-N 
where 


N is a nonterminal symbol; 
string-I is a string, i.e., a concatenation of symbols, some of which may be 


terminal and some nonterminal; 
angle brackets, < and > , serve to delimit tne names of the language 


components (c.g., <expression>, Or <logical constant>}), i.e., nonter- 
minal symbols. 
Such a production presents, therefore, alternative strings that may re- 
place the symbol N. 


Consider, for example, the following productions of the BNF notation: 


<dist>=0 elfo | 4:15 Ee (1) 
<unsigned integer>:= <digit> | <unsigned integer> | <digit> (2) 
<integer>:= <unsigned integer> | +<unsigned integer> | 

—<unsigned integer> (3) 
= The number —73 may be derived using these productions as follows (ap- 
plicable productions are shown above the arrows): ` 


: (3) ; (2) ? i 
<integer> => —<unsigned integer > => —<unsigned integer > 
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Assombly Language Machine Language 
LOAD J 1 60 
ADD XK ; 3 61 
STORE I 2 59 


As you can 5286, there is a simple translation from assembly 
language to machine language. writing programs in machine 
language is even more inconvenient than writing programs in 
assembly language. People almost always prefer assembly language 
over machine language; they use a program called an assembler to 
translate mnemonic names in assembly language programs to 
corresponding numeric operators and operands. Although we do not 
show it here, assemblers allow the programmer to rossrve and 
initialize memory for variables and constants. For example, 
location 59 would be reserved for I, and location 98 could be 
reserved for EIGHT and initialized to 8. 


STORING MACHINE INSTRUCTIONS IN WORDS 


The values of variables of a program are stored in words of 
the computer's memory. In a similar manner, the instructions of 
the program are stored in words of memory. We can use two words 
to hold each VS instruction; one word for the- operator ‘and one 
word for the operand. Here we show three instructions stored’ in 


locations 18 through 23: 


We could have saved space if the VS computer allowed us to pack 
the operator and operand into a single word. For example, the 
instruction > 


i 59 
could bə packed into a single word as 
1059 


with the convention that the rightmost three digits are the 


operand and the other digits are the operator. Instructions for 


real computers are packed into words to save space, but to keep 
things simple, the VS computer uses two words for its 
instructions. z 


A JUMP instruction has as its operand the labol of an 
instruction. When a JUMP instruction is written in machine 
language, the label must be a number. The number used is the 
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ASSEMBLY LANGUAGES 
AND MACHINE LANGUAGE 


In this book we have presented programming in terms of the 
fesca: language. Pascal is a high-level language; it provides us 
ith a convenient means for directing a computer to do work. The 
omputer cannot execute Pascal programs directly; it can only 
frec». programs in machine language, 4a low-level language. 
efore a Pascal program can be executed by a computer, the 
rogram must be translated or compiled to machine language. In 
f: chapter we will explain how a computer carries. out 
nstructions. We will present features of machine languages and 
their associated assembly languages. i 


MACHINE INSTRUCTIONS 


In Chapter 2 wa gave a brief introduction of machine 
anguage. We explained that the instructions a computer can 
Gecuts are much more basic than Pascal statements. These 
machine instructions use a special location, called the 

arithmetic or making assignments. For 


; ccumulator, when doing 
feis, the assignment of J to I, written as the Pascal 
tatement a ; 
y I:a; i 


could be translated to the instructions 


fi LOAD J (copy J into the accumulator) 
à  S8TORE I (copy the accumulator into I) 


Y another example, the Pascal statement 
I:=J+K; 7 


t. be translated into the three instructions 








STORAGE. OF PROGRAM IN COMPUTER 


(PUT_STRING TITLE) 
(LOAD ONE) 
(STORE I) 
(LI: LOAD EIGHT) 
(SKIP_LE I) 
(JUMP L2) 
(LOAD I) 
(PUT_INTEGER) 
(LOAD I) 
(ADD I) 
(STORE I) 
(JUMP LI) 

- (L2:HALT) 


(59 CORRESPONDS TOT) 


(Si CORRESPONDS TO ONE) 





(98 CORRESPONDS TO EIGHT) 


SPECIAL STRING MEMORY 


I| ‘POWERS OF 2' 
Ps SESE Rie ee 


(= $4 


ACCUMULATOR ee 


# 


| 
| | 


IF I?K THEN 
GOTO 23; 
K:sI+J;3 
23:I:"J; 


GOTO statements were purposely left out of ps/k because careless 
use of them leads to unreadable programs. One of the reasons 
that low-level languages are inconvenient to use is that they do 
not directly provide looping constructs, such as WHILE...DO, and 
selection constructs, such as IF...THEN...ELSE. The programmer 
must build up these constructs using instructions like jumps and 
skips. When an PS/k program is translated into a low-level 
language, the loop and selection constructs appear as jumps and 


skips. 


INSTRUCTIONS FOR A VERY SIMPLE COMPUTER 


The VS computer has an instruction to print the value in the 
accumulator: 


PUTINTEGER 


This instruction needs no operand because the accumulator's value 
is always printed. There is an instruction to print messages: 


PUTSTRING operand 


The .operand represents a string to be printed. There is an 
instruction that directs the machins to stop executing a program: 


HALT 
The HALT instruction has no operand. 

Altogether the VS computer has nine instructions; most real 
computers have many more instructions, typically around i00. 


This table lists the VS instructions. 


Operator Opeorand Action by Computer 


+ LOAD variable Assign variable to accumulator. 

2 STORE variable Assign accumulator to variable. 

a2 ADD variable Add variable to accumulator. | 

4 SUBTRACT variable Subtract variable from accumulator. 

5° JUMP label Jump to labeled instruction. 

6 SKIPLE variable If variable<#accumulator then skip 
next instruction. 

7 PUTINTEGER (none) Print the integer in the accumulator. 

8 PUTSTRING string Print the string. 

9 HALT (none) Halt, the program is finished. 


We have purposely kept the VS computer simple by leaving out 
instructions that might normally be part of the instruction set 


e? 





Similarly, the simulator can carry out the other VS instructions. 
After each instruction is carried out, tho INSTRUCTIONPOINTER is 
incremented by 2 and OPERATOR and OPERAND are set for the next 
instruction. When the instruction is 4 JUMP or SKIP, then 
INSTRUCTIONPOINTER can bo modified so an instruction other than 
the next sequential instruction will be selected. For oxamplo, 
if the OPERATOR is 6, for SKIPLE, the simulator executes this: 


TE MEMORY [ OPERAND ]<#ACCUMULATOR THEN 
INSTRUCTIONPOINTER: *INSTRUCTIONPOINTER+2) 


we will use named constants for 6ach of the VS instructions: 


‘ 


CONST LOAD=1}3 
STORE™2} 


HALT#=9}3 


Thasa declarations should ba global to the simulator procedure. 
and wa will uso 


CONST FIRSTINSTRUCTION=03 


to specify that execution begins with the instruction in words 
zero and one. i 


Now wa give the complete simulator as a Pascal procedure. 
This procedurs assumes. that the MEMORY and STRING arrays have 
been declared and initialized. i 
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represent variables. For example, the variables I, J and K could 
be represented by the words with locations 59, 60 and 61. Here 
we show these three words after I, J and K have been assigned the 


values 9, 0 and 14, 
“(ee 
ol 2S 
Te 


There is no special significance to 59, 60 and 61. We could just 
as well represent I, J and K by locations 42, 3 and 87; the 
important thing is to remember which location corresponds to 
which variable. 


If I, J and K correspond to location 59, 60 and 61, we can 
write the instructions 


LOAD J 
ADD K 
STORE I 

as 
LOAD 60 (copy contents of word 60 into accumulator) 
ADD 61 (add contents of word. 61 to accumulator) 
STORE 59 (copy accumulator into word 59) ` 


The VS instruction operators, LOAD, STORE and so on, ars 
numbered. LOAD is operator number 1,. STORE is 2, ADD is 3 and so 
on. The names LOAD, STORZ and ADD as used in the VS instructions 
are mnemonic names; a mnemonic name is an "easy-to-remember" 
nams. We can choose the names of the operands so that they too 
are easy to remember. 


Using the numbers of the operators we can write 


LOAD 60 
ADD 61 
STORE 5$ 
as i ss 
1 60 
3 61 
2 59 


Instructions that consist only of numbers are in machine 


lJanguaga. Instructions that contain mnemonic names, such as LOAD 
~and I, are in assembly language. - ; 





business may buy a new computer to replace an old computer. 


Aftor the old computer is removed, programs written for the old 


computer can ba executed by 4 simulator running on the new 


machine. 


Sometimes 4 hypothetical computer is designed to help solve 
some particular problem. This is the case with several Pascal 


compilers especially those that run on microcomputers and 


minicomputers. A hypothetical computer is designed to allow easy 
mputer's 


translation from Pascal programs to the hypothetical co 
machine language. The translated Pascal programs are executed 
using a simulator for the hypothetical machine. Other compilers, 
such as the Pascal 6000 compiler, translate programs into the 
real computer's machine languages; then ‘a simulator is not 
required beacause the translated program is executed directly by 


the computer. 


CHAPTER 21 SUMMARY | 


In this chapter we have presented features of machine 
language in terms of a. very simple hypothetical computer called 
VS. The VS computer has an accumulator that is used for doing 
calculations. There are VS machins instructions for loading, 
storing, adding tO, subtracting from, and printing the 
accumulator. There is 4a machine instruction for printing 
strings. There are instructions for jumping to instructions, 
skipping instructions and for halting. The nine VS machine 
instructions were sufficient for the translation of the example 
Pascal program given in this chapter. Real computers typically 
have many more instructions. The following important terms were 


discussed in this chapter: 


word ~ the computer's main memory is divided into words. 
Bach word can contain a number. In real computers, 4 
word can contain - goveral characters, for example, 4 


characters. 


Location (or address) - the number that locates a particular 
word in the computer's main memory. 


Operators and operands ~ most VS machine instructions, such 
as, 


LOAD I 


consist of an operator and an operand; these are LOAD 
and I in this example. some instructions have an 
operator but no operand. 


Mnemonic name -~ a nams that helps programmers remember 
something. For example, STORE is the mnemonic namo for 


VS machine instruction number 2. 
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location of the instruction 'being jumped to. Here is a 
translation of assembly language into machine languages; the label 


.L bocomes 48: 


* 


JUMP L 40 [ - 6 | i 
= ame 


Pon EE 


= Just as tho variablos and instructions are stored in words in 
memory, strings such as ' POWERS OF 2' are stored in momory. In 
real computers this is done by packing goveral characters into 
each word. Since mixing charactors and numbers is confusing, We 
will assumo that the VS computor has a separate part of its 
momory uged only for strings. Each string is saved in a 
different location in the special string memory. If the string 
' powERS OF 2' is in location number 1 in the special string 
memory, then we translate the Pascal statement 


LN 


WRITELN(.' POWERS OF a iF 


to tha machine instruction 


8 1 (PUTSTRING TITLE) 


shown how to translate all VS instructions into 


nto machine language. We will xroturn to QUT 
of 2 and will translate it to machine 


Wa have now 
numbars and thus i 
program that prints powsrs 
language. 


A COMPLETE MACHINE LANGUAGE PROGRAM 


a VS computer starts by executing the 


Wo will assume that 
we will place. gur machine 


instruction in words 0 and 1. So 
language instructions in words 0, 1, 2, 3. «+: Wo will continue 
assuming that variabla I corrosponds to memory location 59. The 


ants 1 and 8 will bo represented by memory locations 
91 and 98; those locations are initialized to hold the values 1 
and 8 beforo the program is oxccuted. We show the program as it 
would appear in momory aftor having executed instructions in 
locations 0 through 14. Up to this point the program has printed 


intoger const 


POWERS OF 2 
1 


The VS computer has an instruction pointor, prosontly set to 16, 


that locatos the poxt instruction to ba axocuted. 
“dnstruction has no operand, wè give it a dummy oporand of zero}r 


for oxamplo, HALT bocomos 9 0. 


When an. 
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TEST BANK 


Chapier 1 


Computers and Programming with 
Pascal 


MULTIPLE CHOICE 


et 


1-2. 


1-3. 


1-4. 


1-6. 


The actual tasks that computer systems can perform are quite limited and can be divided 
into three categories. They are operations. 

a. addition, subtraction, and multiplication 

b. addition, subtraction, and division 

c. arithmetic, storage, and division 

d. arithmetic, comparison, and storage and retrieval 


The three basic components of a computer are z ; 

a. input devices, output devices, and secondary storage devices 

b. a central processing unit, a primary storage unit, and peripheral devices 

c. a central processing unit, secondary storage devices, and peripheral devices 
d. input devices, output devices, and peripheral devices 


Which of the following is the "brain" of the computer? 
a. the peripheral devices c. the input devices + 
b. the central processing unit ` d. the primary storage unit 


Which of the following is a peripheral device? 
. primary storage unit 


a. the central processing unit c 

b. main memory d. an input or output device 
Input devices are used to 

a. enter data into the computer c. store data 

b. process the data d. print data 


Output devices are used to 

a. enter data into the computer 

b. allow the user to examine the processing results 
c. calculate formulas 

d. process the data 
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SIMULATING A COMPUTER 


` 


A VS computer has never been built and undoubtedly never will 
be built. It might seem that we can never have a vs machine 
language program executed. But we can, by making an existing 
computer simulate a ys computer. This is done by writing a 
progran, called a simulator, that acts as if it is a VS computer. 
We will discuss later in more detail the importance of simulators 
in computing, but first we will develop a Pascal procedure that 
is a simulator for the VS computer. 


The VS computer has an accumulator, which can be simulated by 
a variable declared by 


VAR ACCUMULATOR: INTEGER; 


It also has a memory containing 100 words, whose addresses are 0 
to 99. This can be simulated by an array: 


CONST MEMORYSIZE=993 — 
VAR MEMORY: ARRAY[0..MEMORYSIZE] OF INTEGER; 


There is a special string memory. Assuming that the VS computer 
can hold, at most, 10 strings of length at most 80, we can 
simulate the string momory by another array: 


CONST STRINGSIZE=80; 

© .NUMBEROFSTRINGS® 10; : 

VAR STRING: ARRAY Í 1. .NUMBEROFSTRINGS ] OF = 
PACKED ARRAY[1..STRINGSIZE] OF CHAR; 


We noed an ‘instruction pointer to keep track of which instruction 
is to ho executed next. 


VAR INSTRUCTIONPOINTER: INTEGER; 


When the VS computer is executing, the instruction pointer 
has a particular value, say 10, indicating that word 10 contains 
the operator of the next instruction to be executed. Word 11 
contains the operand. If OPERATOR and OPERAND are declared as 
INTEGER variables in the simulator, then they should be given 


values by: 


OPERATOR: «MEMORY [ INSTRUCTIONPOINTER] ; 
OPERAND : eNEMORY [ INSTRUCTIONPOINTER+ 115 


If the OPERATOR is 1, meaning LOAD, the simulator carries out the 
LOAD machine instruction by executing: 


ACCUMULATOR: “MEMORY [ OPERAND] ; 


Tf. “the OPERATOR is 2, meaning STORE, the simulator carries out 
the STORE instruction by executing: 


MEMORY [OPERAND] : "ACCUMULATOR; 





1-17. The loop would probably be used = 
a. to execute a sequence of statements once 
b. ina program that determines the larger of two numbers 
c. ina program that adds two numbers together and determines the average 
d. ina program that finds the sum of two hundred numbers 


1-18. Ina program that calculates the sum and product of three numbers, it is best to use the- 


a. loop ; ; . c. simple sequence 
b. decision structure d. comparison structure 
1-19. Subprograms are ; 


a. incomplete programs 

b. parts of a program, each designed to perform a specific task 
c. unstructured programs that tend to contain errors 

d. programs that are written in machine language 


1-20. Pascal is é 
a. a block-structured language c. arelatively old programming language 
b. a low-level language d. named after its designer 


1-21. TURBO Pascal is : 
a. another name for Standard Pascal 
b. implemented on microcomputers 
c. a low-level language 
d. not popular with programmers because it is expensive and slow 


1-22. . Which of the following statements about data is true? ; 
a. Itis facts that have been collected but not organized in any meaningful way. - 
b. Itis the result obtained after an input has been processed. 
c. Itis a set of instructions that a computer uses to solve a problem. 


d. Itis also referred to as output. 


. 1-23. Which of the foliowing statements about information is tue? 
a. Itis facts that have been collected but not organized in any meaningful way. 
b. Itis the result obtained after an input has been processed. 
c. Itis a set of instructions that a computer uses to solve a problem. 
d. It is entered into the computer to be processed. 


1-24. Before a program written in a high-level language can be executed, it must be translated 
into machine language. The program to be translated is referred to as the 
a. object program c. subprogram 
b. source program d. application program 


1-25. What is a characteristic of structured program languages? 
a. They are usually low-level languages. 
b. They make it easier to break down a program into modules. 
c. They are more difficult to use. 
d. They can be executed directly by the computer. 
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ad (# THIS PROCEDURE SIMULATES A VERY SIMPLE COMPUTER *) 


PROCEDURE SIMULATOR; ; 
[| VAR ACCUMULATOR, INSTRUCTIONPOINTER ' 

OPERATOR, OPERAND: INTEGER} 
BEGIN 


INSTRUCTIONPOINTER: 
OPERATOR : -MEMORY | INSTRUCTIONPOINTER) | 


OPERAND: “MEMORY | INSTRUCTIONPOINTER+ 111 
WHILE OPERATORCHALT DO 
BEGIN 
CASE OPERATOR OF 
LOAD: 
ACCUMULATOR: "MEMORY [ OPERAND] ; 


STORE: 
MEMORY [ OPERAND ] : ACCUMULATOR: 
3 ADD: 
ACCUMULATOR: ® 
SUBTRACT: 
ACCUMULATOR: 


JUMP: 
INSTRUCTIONPOINTER: SOPERAND~ 2] 


SKIPLE: 
IF MEMORY [ OPERAND ] <#ACCUMULATOR THEN 


 NSTRUCTIONPOINTER? *INSTRUCTIONPOINZERS2; 
PUTINTEGER: 
WRITELN (ACCUMULATOR) ; 


PUTSTRING: - 
WRITELN (STRING [ OPERAND] ) 


=uPIRSTINSTRUCTION; 


ACCUMULATOR+MEMORY [ OPERAND |; 


eACCUMULATOR-MEMORY [ OPERAND ] ; 


(* 2 ADDED BELOW *) 


INSTRUCTIONPOINT 
LINSTRUCTIONPOINTER] ; 


OPERATOR: MEMORY 
O ERAND : “HEMORY | INSTRUCTIONPOINTER® 1] 


END 
END} 
rf you want to run a VS machine language program, you can 
procedure to put the numbers representing the 
constants into the MEMORY array, initialize the 
SIMULATOR procedures. 


write a main 
program and 
STRING array and then call the 


USES OF SIMULATORS 


We will now discuss some of the uses of simulators. Our 
simulator for the VS computer can be used to exocuto VS machine 
it can serve another purpose, too... BY 


language programs. put 
determines the actions 


reading the SIMULATOR procedure, you can 
h vs instructions if you did not know how 4 vs 
id find out by studying its simulator. 


carried out for eac 
t ons computer to act like 


computer worked, you cou 
go not only can the simulator direc 
another, it can also show how & computer works. 

programs written 


rs aro often usod to allow 
example, 4 


Computer simulato 


to execute on another machine. For 


for one machine 


[ 5 END; 
; BR: eINSTRUCTIONPOINTER+2 5 
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 gimulator - a. program that sim 


, 


purely numeric language that is 
directly executed by a particular type of computer. 
Some computer manufacturers sell families of computers, 
of various sizes and speeds, that. all use the same 


machine language. 


Machine language -~ the 


Assembly language -~ programs in assembly language use 


mnemonic names corresponding to the numeric operators of 
machine language. They also permit programmers to 


choose mnemonic names for the operands and labels. 


Assembler ~ a program that translates programs written in 
assembly language to machine language. 


Label ~ a nane that gives the location of a machins 
instruction or a statement. The JUMP machine 
dinstruction, as written in assembly language, transfers 
statement, 


control to a labeled instruction. The GOTO 
ag written in full Pascal, but not in ps/k, transfers 


control to 4 labeled statement. 


ulates some system such as a 


computer. A simulator treats a sequence of numbers as a 


machine language 
operations. 


program and carries out the specified 








1-7. 


1-9. 


1-10. 


1-12. 
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Secondary storage is used to 

a. enter data into the computer 

b. allow the user to examine the processing results 
c. store data 

d. process the data 


Anexample of an input device is a 


a. monitor screen as floppy diskette 

b. printer d. keyboard 

Which of the following is NOT computer hardware? 

a. aprogram c. amain memory ~~ 
b. a central processing unit d. peripheral devices 

A program is 


a. a sequence of instructions that work together to allow the computer to solve a 
specific problem . 

b. the input to a given problem 

c. alist of system commands 

d. computer hardware 


All of the following are types of computers EXCEPT Po 
a. supercomputer c. macrocompu 
b. microcomputer d. mainframe 


Which of the following statements about software is false? 

a. Itis a program or a series of programs. 

b. Itcan be divided into categories: application programs and system programs. 
c. Software is not the physical part of the computer system. 

á. Itis only used with microcomputers. : 


. * Which of the following is a type of programming language? 


a. assembly — ¢. symbolic 
b. interpreter d. compiler 


Which of the following statements about a high-level language is true? 

a. Itis the only language that the computer can directly execute. 

b. It uses ones and zeros to specify various machine operations. 

c. Itis strongly oriented toward the programmer rather than the computer. 

d. It requires the programmer to understand the technical details of internal computer 


operations. 


Which of the following statements about a low-level language is true? 

a. Itis the only language that the computer can directly execute. 

b. Ituses symbolic names to specify various machine operations. 

c. Itis strongly oriented toward the programmer rather than the computer. 
d. Itis another name for machine language. 


The three basic structures that can be used to solve programming problems are 
a. the simple sequence, the decision structure, and the loop 

b. the simple sequence, the decision structure, and the comparison structure 
c. the loop, the comparison structure, and the simple sequence 

d. the loop, the decision structure, and the comparison structure 
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1-26. 


1-27. 


1-28. 


1-29. 


Which system program translates each program statement into machine language and 
then executes it before going on to the next statement? 


a. assembler 7 c. compiler 
b. translator d. interpreter 


Which of the following is false? 

a. Interpreters often use more computer time than compilers. 

b. TURBO Pascal programs are translated into machine language by a compiler. 
c. Acompiler translates and executes each program statement one at a time. 

d. . Using an interpreter saves space in the computer's memory. 


Programs are translated and executed and the results retumed to the monitor screen in 


seconds with processing. 


a. continuous c. quick 


b. interactive d. batch 


Several user programs are grouped and processed one after another in a continuous 


stream with processing. 
a. batch c. interactive 


b. continuous 


d. quick 
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SAMPLE EXAM #1... 


Multiple Choice Questions® 
eee ae 
Directions: For each of the following choose the best Tos 


Use the following diagram to answer questions 1—4. 


program A: 


procedure B: 


l procedure D: | 


È begin 


procedure E; 


begin 





7 

‘The Advanced Placement Exam will be divided into two sections. The first is an hour and 
one half multiple choice test of 45 questions. The second is an hour and one half essay 
exam of five questions. 


, 
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(D) writeln (i,Cube(i)); 
(E) none of the above 


37. The decision to choose a particular sorting algorithm should be made based 
on 5 


I. order of magnitude of the sort (O(n), O{nlogn}, etc.) 
Il. size of the file 
Ill. original sequence of the data 


(A) I 

(B) I ; ° 
(C) I : 

D) 1 & M 

(E) all of the above 


38. The following program will output 


program Names (input, output); 
var Sum : integer; 
procedure Add.; 
var Sum : integer; ` 
begin 
Sun-= 5 F Spr: 
writeln (Sum); i $ 
end; it - 
begin (* main *) 
Sum: = iti =. : gales 
writeln (Sum); 
Add; 
writeln (Sum) 
end. 


(A) 2, 10, 2 
(B) 2,2 
(C) 5, 2,2 
(D) 10, 2, 2 
(E) 10, 10 


39. The tree below represents the postfix expression 


| 
| 
| 
[| | 
| 
| 
fi | 
| 
| 
| 
i 
| 
| 
| 
[i | 
| 
| 
(| | 
(| | 
i | 
F . 
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Sample Examinations 


40. The minimum number of fields in a record of type Persdata is 


41.. 


42. 


43. 


p-aa 


6: Convert := Jun; 
7: Convert := Jul; 
8: Convert := Aug; 
9: Convert := Sep; 
10: Convert := Oct; 
11 : Convert := Nov; 
12 : Convert := Dec; 
end; (* CASE OF MONTH *} 
end; (" FUNCTION CONVERT *) 


(A) 5 
(B) 12 
(C) 7 


{D) 14 


(E) 8 


Which of the following will cause the assignment of the proper value to the ` 


Month field of the employees birthdate? 

(A) zead|Mnth]; 
convert|Mnth); -> 

(B) readIn|Employee. Birth. Month), 

(C} readIn|Employee.Birth); ` 

(D} readIn{|Mnth), - 
Employee.Birth.Month := Math, 

(E) ReadIn{OrdMonth); 


Employee.Birth.Month := Convert(OrdMonth); 


The outcome of a call to Convert of the form 
Convert(13); 


will produce 


{A} no selection made and execution continues normally following the end. 


(B) Jan is selected 

(C) nil is assigned to the Month 

(DY Month is undefined 

(E) blanks are assigned to the Month 


Given the following procedure heading 


procedure GetStuff var Phone : Telephone; 
var Name : Str32; 
Month : MonthAbrev; 
OK : boolean); 


Which callis legal? aie = 


(A) GetStuff (Employee . Hometel, 
Employee . Name, 
Jan, 
true); 


Dery +: 
= 


Er 
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SAMPLE EXAM #2 ` 


Multiple Choice Questions’ 





Directions: For each of the following make the best choice. (Note: Some questions in 
this test only have 4 choices.) : 


1. In standard Pascal, the operator with the highest precedent is 
(A) not 
(B) div 
(C) and 
. (D) or 
[E] in 


2. For which real number will the results of trunc and round differ? 
(A) 3.2 
(B) 0.0 
(C) 7.0 
(D) -4.2 
(E) 4.8 


3. What is the value of 3 + 10 mod 3? 
(A) 1° 
(B) 6 
(C) 4 
(D) 6.33333 
{E} 2 3 


4. What is the value of the following expression? 
Se 10/ 21 


l 

li 

a on ; | 7 : i 
eee a og . | | 

| i} 

i 

fi 


’The multiple choice questions in sample A.P. Exam #2 were contributed by the Washing- 
ton Area A.P. Computer Science Support Group. The exam may be longer than the standard 
exam. It contains a number of time-consuming questions and students should not be con- 
cerned if they do not complete it in one and one half hours. 


yer 
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10. 


11. Consider the following syntax diagrams 


The following sequence 


read (Xct, Pvol, Obs) ; 
would result in the following values being stored in the variables 


(A) Xct = 31, Pvol = —2806.9, Obs = 3 
(B) Xct = 31, Pvol = 47.08, Obs = 77 
(C) Xct = 31, Pvol = 47.08, Obs = 6 

(D) Xct = 31, Pvol = —2806.9, Obs = 77 
Suppose one wishes to create an array with 100 rows and 10 columns. Each 
cell must store a sing 
plish this creation? 


I var Table : array{1 . . 100, 10 . . 1] of char; 
Il. var Matrix : array[0..99, —4.. 5] of char; 
Il. var Chart : array{200 . .299,.‘A’.. ‘J'| of char; 


(A) I and II only 
(B) I and Ill only 
(C) II and Ill only 
(D) I, II, and Ill 


Which of the following can be an index type of an array'in Pascal? 


i, --1..50 
Il. boolean 
Ti -Oe2 4 


(A) I and II only 


. (B) I and III only 


(C} I and III only 
(D) II only 


le character. Which of these declarations will accom- - 



















































Multiple Choice Questions EE 


(AL A+B-C*D. 
AB ASCH D* + 
(Cc) A+BC-D* 
ID; ABCD =+ 
(EF) +A-BC*D 


The following definitions and declarations apply to questions 40-43. 


type 
MonthAbrev = (Jan, Feb, Mar, Apr, May, Jun, Jul, Aug, Sep, Oct, Nov, 
Dec); 
Str32 = packed array(1 . . 32] of char; 
Address = record 
Street1, Street2, 
City, State, 
Zip : Str32; 
end; (* ADDRESS RECORD `} 
Date =record 
Day ::F = ..31; 
Month : MonthAbrev: 
Year : 1800. . 2100; 
end; (DATE RECORD‘) 
Telephone =record 
: Area : integer 
Number : Str32; 
end; (* TELEPHONE RECORD * ) 
Persdata = record 
Name : Str32; 
HomeAdd : Address; 
Birth: Date; - 
HomeTel : Telephone; 
case Married : boolean of 
true : (NumDepend : integer; 
SpouseName : Str32); 


false :(); 
: end; 
end; {* PERSDATA RECORD ‘). 


var 

Employee : Persdata; 
Ordmonth, Mnth : integer; 
Response : char; 


Function Convert (Month : integer) : Sect Abeev 
begin 
case Month of 

: Convert := Jan; 

: Convert := Feb; 

: Convert := Mar; 
: Convert := Apr; 
: Convert := May; 
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begin (* block2 *) 


end; 
begin (* block1 *) 
end; 


begin (* main program *) - 


end 


13. A variable that is declared in procedure Block3 and only in procedure Block3 
is accessible in 
(A) all of program A 
(B) both procedures Block2 and Block3 
[C] both procedures Block1 and Block3 
(D) procedures Block1, Block2, and Block3 : 
(E) procedure Block3, but not procedures Block! and Block2 es 


Questions 14 and’15 are based ona program with the following structure. 


. 


program Scope (input, output); 
var f | 
A, B: real; : | 
procedure X; 
var . . 

A, B: real; — i | 
procedure Y; i | 
var 

A, B : real; I 
begin (* Y *) . 


end; (* Y *) 
begin (* X *) 


- end; (* X *) 
begin (* SCOPE *) 


end. (* SCOPE *) 








‘eh 


=. C a Bere kee] -l 9 - omi = 
. 


PP omar spe rt OT x 
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44. 


ie 


Multiple Choice Questions 


(B) GetStuff (301 555-1212, John Smith, Jan, 
ae true); 
(C) GetStuff (Area, Number, Name, Month, 
Married); 
(D) GetStuff (Telephone, Name, Month, true); 
(E} GetStuff (HomeTel, Name, 12, true); 


Arithmetic logic units 


I. store data 
Il. perform arithmetic operation 


` I. perform comparisons 


IV. communicate with input devices 


(A) II only 
(B) HI only 
(C) H&H 
{D} I, II and IV 
(E) I, I and II 


Translating a Pascal program into a machine language program is generally 
the job of 

(A) arithmetic logic unit 

(B) the compiler : 

(C) primary storage unit 

{D) filer 

(E) none of the above 
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19. 


20. 


pean e 
readin (C, X, Y;); 


(* statement including invocation(s} of Zmake *) 


writeln (Z); 
end. 


Given the input —3 0 4, and the statement Z := Zmake(C, Octo, X) + Zmake 
(Y, C, Zmake|X, Y, C)}, the writeln statement will produce the value 

(A) -192 

(B) —156 

(C) 156 

(D) 192 


Show what is printed by the following program - 


program Test6; 

const 
Offset = 20; 

var 3 
Tl, Te, T3 : integer; 


procedure Test (var P1, P2, T1 : integer); 
begin 

readIn |P1, P2, T1); 

T1.:= PiP + TI 
end; 


begin 
Test(T1, T2, T3); ° 
T2 := T2 + Offset; 
writeln (‘THE ANSWERS ARE’, T2, T3, T1, Offset) 


- end. 
DATA 105 12 
The possible answers are 
(A) 10 15 5 20 
(B) 15 7020 
(C) 25 17 10 20 
(D) 35 27 20 20 
type e 
Vowel = {A,E,1,0,U); 
var 
Letter : Vowel; 
Ch : char; 





Multiple Choice Questions 


5. Which of the following is a valid identifier in Pascal? 


(A) 3box 
(B) R2d2 
(C) $good2be 
(D) Get Smart 


function Physmark (Veloc: real; n1, n2 : integer)’: real; 


In an attempt to identify a Pascal subprogram, which of the following state- 


ments is a true statement concerning this subprogram identification? 
(A) A function requiring four parameters, two of integer and two of real. 


(B) A procedure requiring two parameters, one of real and one of integer. 
(C) A function requiring three parameters, one of real and two of integer. 


(D) A function requiring one parameter of type real. 


Assume the following declarations 


const 
MaxDev = 20; 
Refr =-8.5; 
type 
Indic = 1..MaxDev; 
vat 
C11, C12, C13 : array [Indic] of real; 
Txr, Swb, Yval : real; 
Num, Nval, Amt : integer; 
U, V, G: Indic; 


Using the above declarations, write a Pascal expression for: 


Num 
Amt + 4 


(A) Txr := Num/Amt + 4 
(B) Nval := Num/(Amt + 4) 
(C) U:= Num/(Amt + 4) 
(D) Swb := Num/(Amt + 4) 


Assume the following declarations 


var 
Trwt, Pvol, Roof : real; 
Xct, Obs, Nmax, Side : integer; 
Ltr, Sym, Vwl : char; 


and the following four lines of input 


31 bbbb47.08bb6bb—9bbb85. 1 
~2806.9bbb77bbbRWB 
3] -42 K A Col 
5 508 4M7 
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Use the following information to answer question 25. 


Suppose that L is a list of distinct numbers arranged in increasing order and 
that a binary search is to be used to determine whether some number x occurs in 
this list. 


25. If L contains 100 entries and x actually occurs in the 56th entry of L, how 
many entries from L will be examined by the search? 
(A) 3 
(B) 4 
(C) 7 
(D) 56 
{E} 100 


26. Consider the following function definition: 
2+ (XK - 3, Y - 1) ifX > YandX>0 
f(X,Y) = 1 + £(Y,X), if Y >=X and X > 0 
; if X <=0 

Find £(1,2) 

(A) 0 
(B) 2 
(C) 1 
DOLS- 
(E) 4 


27. Consider the following sequence of procedure calls where Add pops 2 hum- 
- bers from the stack and pushes their sum, and Mult pops two numbers and 
pushes their product. t - : 


Push X fi 

Push Y j 

Add f 

Push X 

Mult i $ ; 

Push Y l í f 

Add ; 
If X = 2 and Y = 3, find the value the stack contains. | 
(A) 12 
(B) 15 
(C) 13 i 
(D) 25 
(E) 10 fi 


28.. Algebraic formulas can be represented in trees. Reading inorder the given tree 
represents which algebraic formula? : 


` 





12. 


Multiple Choice Questions 


punctuation: letter: 





Which of the following is invalid? 


(A) AT’S 
(B) A’TS! 
(C) ? 
(D) SAT. 
(E) S’! 
The following fragment is designed to compute the number M of distinct 
elements among Afl], ... ,A[N] and place these M distinct elements into 
Al ... AM]: : 
M:=1 


for J ag to N do 
if A[/] <> A[M] then 


begin 
M:=M+ l; 
A[M] := All] 
end; 


‘What is the maximum number of times the fragment compares elements 


from the array A? 
(A) N 

(B) N+ 1 

(C) M 

(D) M+ 1 

ERN =S= 


Use the following program to answer question 13. 


program A; 
var a, b : integer; 
procedure Block1; 
var a1, b2 : integer; 
procedure Block2; 
var a, a2, b2 : integer; 
procedure Block3; 
var a3, b3 : integer; 
begin 


end; 
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31. 


32. 


(B) 1, 5, 5, 1, 5,0 
(C) 1, 5, 4, 1, 5, 0 
DAS Ro 
(E) 1,5, 43,0 


A postorder traversal of the tree below would cause the nodes to be visited in 
which of the following orders? 


(AJ MPFAXZK 
B) PMAFZXK 
(Cc) PMAFXZK 
DI MPFAZXK 


If ‘Vwd’ and ‘Wstr’ are both declared as a packed array [1 . . 14] of char, the 


sequence ; 
Vwd := ‘LARGE AND BLUE’; 
Wstr := Vwd 


would result in Wstr having the value 


`- (A) EMPTY 


33. 


34. 


(B) LARGE AND BLUE 
(C) not possible to define 
(D) L 


The list 25, 43, 18, 75, 9,55, 32 is to be sorted into ascending order using 
heapsort. The numbers are stored in an array A[1 .. 7]. During the sorting of 
the entire array using heapsort, which of the four numbers 9, 25, 32, 55 will 
be the first to land in its final position? 

(A) 9 

(B) 25 


(C) 32 


(D) 55 


Using a normal bubble sort structure, how many passes through the array 
would it require to sort an original array 7 2 4 9 6 3 to the array 97643 2? 
(The sort will terminate when the array is sorted and will not need to com: 
plete all possible passes.) : 

(A) 5 - 
(B) 3 
(C) 4 
(D) 2 


Q- 


i 
l 
l 
i 
A 
i 
i 
i 
I 
[ 
1 
f 
i 
j 
I 
[ 
l 
A 
i 





Multiple Choice Questions 


14. If variable A is being assigned a value in procedure Y, this will change the 
value of variable A in ; 
(A) the main program (* SCOPE `) only 
(B) procedure Y only 
(C) procedure X and Y 
(D) all procedures and the main program 
(E) procedure Y and the main program only 


15. If we want to access a variable C only in procedure Y, we should declare C in 
the variable section of 
(A) procedure Y only 
(B) procedure Y and X only 
(C) the main program (* SCOPE *) and procedure Y only 
(D) the main program (* SCOPE si 
(E) procedure X, procedure Y, and the main program (* SCOPE *) 


16. Given a 12-element, one-dimensional integer array named Numbers and an 
integer variable named j and the following statements, 


Numbers{1] := 8; 
forj:=2tol2do . - 
Numbers |j] := (j — 1) * Numbers |j — 1} div (j — 1); 


show the resulting contents of numbers. 
(A) all 8s 

(B) 8, 16, 24, 32, etc. 

(C) 8,0, 8, 0, 8, 0, 8, 0 

(D) 8, 64, 132, 264, etc. 


17. The more efficient structure to compute the letter grades A, B, C, D, F, given 
a set of numerical scores and cutoff values of 93, 85, 75, and 65, is 
(A) A multiple decision statement. 
(B) A case statement. ` 
(C) A while-do loop. 
(D) A for-next statement. 


18. Consider the following program skeleton 


program Clip] (input, output); 

const : 
Octo = 8; 

var z 

G A a integer; 

function Zmake (Al, A2, A3 : integer) : integer; 


var 
Pr : integer; 
begin 
Pr := sq(A2 — A3); 
Zmake := Al * Pr 
end; 
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Use the following declarations for questions 40 and 41. 


40. 


41. 


42. 


type 

AgeType = 0.. 125; 
ClassType = (Freshman, Sophomore, Junior, Senior); 
WordType = packed array |1 . . 15] of char; 
Location = record ; 

Number : Integer; 

Street : WordType; 

City : WordType; 

State : WordType; 

Zipcode : WordType 

end; ; 2 
Student = record 

Name : WordType; 

Age : AgeType; 

Class : ClassType; 

Gpa : real; è 

case Housing: boolean of 

true :{DormName : WordType; 
RoomNumber : integer); 
false : (Address : Location) 
end; 
var 

Pupil : Student; 


Which of the following is a valid assignment statement if the student does 
not live in the housing? 

(A) Student . Name := ‘Sue Smith’; 

(B) Pupil. Number := 42; 

(C) Pupil . Housing . Address . Number := 42; 

(D) Pupil . Address . Number := 42; 

(E) none of the above 


Which of the following is a valid statement? 
(A) writeln(Student . Gpa); 

(B) writeln(Pupil . Name); 

(C) writeln(Pupil . Class); 

(D). writeln|Pupil . Housing . RoomNumber); 
(E) none of the above 


Given these declarations, 


type RecPtr = “DataRec; 
DataRec = record 
x: DataType; 
Lnk : ReċPtr 
end; 
var b : RecPtr; 





21, 


22. 


23. 


24. 


Multiple Choice Questions 


Using the above type and variable declarations, which of the following is the 
most appropriate way to assign a value to the variable Letter? 
(A) read(Ch); 


Letter := Ch; , 
(B) read(Ch); ‘ 
if Ch = ‘C’ then Letter := Vowel; ‘ 


(C) read{|Letter); 
(D) read(Ch); 
if Ch = ‘A’ then Letter := A; 
{E} read(Ch); | 
if Ch in [Vowel] then Letter := Ch; 


The list 25, 43, 18, 75, 9, 55, 32 is to be sorted into ascending order using 
heapsort. What is the depth of the tree after the largest element has been 
placed in its final position? (Count the root as level zero.) 

(A) 1 

(B) 2 

(C) 3 

(D) cannot be determined 


A list of integers is read in, one at a time, and a binary search tree is con- 
structed. Next the tree is traversed and the integers are printed. Which traver- 
sal would result in a printout which duplicates the original order of the list of 
integers? 

(A) preorder i i 

(B) inorder : 

[C] postorder 

(D) none of these | 


IfA = 1, B= 3, C = 2, and D = 4, what is the value of the postfix expression 
AB-GD2S 


(A) -8 
(B) 8 
(C) 4 
(D): =4 


Write the following infix expression in postfix (RPN) form. 
A+B-C*D 


(A) ABCD + =" 
(RB) ARLD* + = f 
IC CDA = 
D AB+CD* - 
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l. Consider the following sequence of procedure calls: 


Push(s)+ 

Push(y); 

Add; 

Push(z); 

Push(w); 

Mult; > ž A PAATE ' z 
Add; 


Invoking Push causes its argument to be pushed onto a stack. 
Invoking the procedures Add or Mult causes (1) the stack to be 
popped twice, (2) the two popped items to be added or multiplied, 
and (3) the result to be pushed onto the stack. If x = 10, y = 20; 
z = 30, and w = 40, then at the end of the sequence of procedure 
calls above, the stack contains 


(A) nothing (B) 0 (Cc) 940 (D) 1230 (E) 1410 


Suppose that items Xl, X2, X3, X4, and X5 are pushed, in that order, 
onto an initially empty stack S, that S is then popped four times, 
and that as each Xi is popped off S, it is inserted into an initially 
empty queue. If one Xi is then deleted from the queue, what is the 
next item that will be deleted from the queue? : 


(a) xr (B) X2 (C) X6 (D) X4 (E) X5 


A retail merchant wishes to keep an inventory of items identified 

by six-digit identification numbers. The quantity of each item 
ranges from 0 to 1000 and must be updated each time that* item is 
bought or sold. New identification numbers are created, and old 
ones discontinued, on a regular basis. How might one best maintain 


. the inventory if one wishes to economize on both storage and 


processing time? 


(A) Use a single list A, letting Aln] be the quantity of item n. 

(B) Use a single list A, letting aln] be the quantity of item 
2*n-1 plus 1000 times the quantity of item 2*n. 

(C) Use two lists A and I, placing identification numbers in: <n 
the order in which they are encountered and letting Aln] be the 
quantity of item Tin}. : 

(D) Use two lists A and I, placing identification numbers in I in 
the first available location beyond that indicated by a hashing 
function, and letting A[n] be the quantity of item Hrne 


Which of the following would be the most likely reason for building 
file backup procedures into a computer system offering services 

to many different users? 

(A) To serve the public's right to know 

(B) To enable supervisors to monitor system use 

(C) To improve processing speed 

(D) To facilitate the interchange of information 

(E) To allow recovery from system and/or user errors e-3 


ca 
g 
| 


TT TT 


29. 


30. 
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(A} a — b * c/d + e/b 
(B) a — b * (c/d + e/f) 
(C) a — b * (d/c + b/e) 
(D) a — b * c/d + e/b 


Starting with the element A, the items A, B, C, D, and E are pushed upon a 
stack. The stack is popped four times and each element is inserted into a 
queue. Then two items are deleted from the queue and pushed back on the 
stack. Then one item is popped from the stack. What is it? 

(A) A ` 

(B) B 

CEE 

DD: 

(E) E 


Consider the following sequence of procedure calls 


Push{a); 
Push(b); 
Pop\c); 
a =a +37; 
Push{a); 
Push{c); 
Push\{b); 
b := a; 
Push(b); 
while not Empty do 
begin 
Pop(a); 
write(a) 
end; 


Invoking Push causes its argument to be pushed onto a stack. Invoking Pop 
pops the stack and returns that value to its argument. Empty is a Boolean 
_ which is true when the stack is empty. If initially the stack is empty, a = 0, 
b=5, and c = 4, then the sequence of numbers in the order of output is: 


(A) 5, 5, 5, 1, 5, 0 





is 


12. 


13; 


14. 


15. 





“Var a,temp:integer; 


-3- 


If maxint is large enough to compute m what would it be. 


Function z(x:integer):integer; 





Begin : 1 
i£ (x=1) or (x=3) 
then z:=x 
else 2:= x * -2(x=1) | 
end 
Begin 
m: +- af {23% 26a a | 
end. i 


Which are the first four digits of the octal representation of .7164? p 
(A) .5566 (B) .6655 (Cc) “35656 (D). .6565 


Which of the following gives the number of binary digits required to fi 
represent an integer N. 


(A) trunc(log,N) (B) log5N (C) N tH) T trunc{10g,N) 


what will the following program print if the input line is 


ae 
Program ugh(input,output); 


Procedure switch (var x,y:integer); 
Var temp: integer; 


end; 
Begin 
readln(temp,a); 
if temp <a 
then switch (temp,a); š 
writeln(temp,a) 
end. 


$ 


. 


. 


(A) 25.5 (B) 5 3 (C) 3 3 (D) 5 5 _ (E) none of these 


What will the following program print if the input line is below? 
0.65 


Program Sue(input, output); 
Const size= 4; 
Var x:real; 
i,j:integer; 
Begin 
readln(x); 
write’ <* t 
for i:= 1 to size do 
begin 
n:= trunc(8*x); 
write(n:1); i 
x:=8ž*x-n. ot 
end ; 4 
end; 


(A) .514 (8} 255432 (C) .51463 (D) .5146 (E) none of these. 


35. 
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Suppose. the following list was subjected to a bubble sort: 
53942 


The items are to be sorted from the smallest to the largest. The after one pass 
through the list, how does the list appear? 

(A) 35429 

(B) 53249 

(C) 23459 

(D) 53429 


Use the following information to answer questions 36—39. 


Suppose that an insertion sort is to be used to put each of the following sets into 
increasing order. at 


36. 


37. 


38. 


39. 


52314 
13524 
43215 
23415 
21453 


mOOM> 


Which list will need the fewest changes of position of individual elements? 
(A) A 
(B) B 
(C) C 


. (D) D 


(E) E 


How many movements (changes in position) will be needed in the least case? 
(A) 1 
(B) 2 
(C) 3 
(D) 4 
EES 


Which list will need the greatest number of changes in position? 
(A) A | 

(B) B 

(C) C 

(D) D 

({E}-£ 


How many movements will be needed in the worst case? 
(A) 6 
(B) 7 
(C) 8 
(D) 9 
{E) 10 








ee 
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Suppose that one of many tasks a large program must perform is to 
display information about particular items in a table. Of the 
following, which indicates the best design of a procedure P that 
performs this task upon receiving a specification for an item? 


(A) P searches the table for the specified item and then 
displays the information about that item, 


{By 2 searches the table for the specified item and then 
calls a procedure D to display the information about 
that item. 


(C) P calls a procedure S to search the table for the 


specified item and then itself displays the information 


about that iten. 
(D) P displays the whole table. 
(E) P calls a procedure S to search the table for the 


specified item and then calls a procedure D to display 
the information about that item. : 





43. 


Explanatory Answers 
if a linked list is constructed of records of type DataRec, which of the follow- 
ing pairs of statements must be placed beteween begin and end in this pro- 
gram fragment to find the sum of every other (first, third, etc) data item? 


Sum := 0 
while b<>nil do (* b points to the first record in the list *) 


(A) Sum := Sum + b’.x; 


b.:=-bob".ink 
(B) Sum := Sum + b*.Lnk’.x; 
b := DLAK 
(C) Sum := Sum + b*.b*.Lnk; 
b := b’.Lnk s 
(D) Sum := Sum + b^.x; 
:= b*.Lnk*.Lnk 


(E) Sum := Sum + b^.Lnk 
b := b*.Lnk*.Lnk 


If Ptr is a pointer, what built-in Pascal routine creates a variable of the type 
referenced by the pointer Ptr? - 

(A) Get(Ptr) 

(B) Ptr := nil 

(C) New{Ptr) 


-  (D) Dispose(Ptr) : ; i 


For questions 44 and 45 consider the following declarations: 


44. 


45. 


* type pointer = “Node 
Node = record 


Info : real; 
Next : Pointer 
end; 


var P1, P2 : Pointer; 


The call new{P1) 
(A) creates the pointer P1 
(B) replaces the old pointer P1 with a new pointer P1 


‘(C) creates the variable P1* and puts its address into P1 


(D) initializes the variable P1 to zero 
(E) returns P1* to the available nodes so that it can be reused 


Which of the following is valid? 


(A) P1 := P2 
(B) P1:= 100 
(C) P1* := P2 
(D) PIES E2 


(E) P1* := nil 
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5. Which of the following statements, when used as the body of the 
; function definition 


FUNCTION fact(n:integer) : integer; ` i 
BEGIN -o 3 


. z 3 5 
END; 


will enable that function to compute n! correctly for any n > 0, 
where n! = n#(n-1)#(n-2)%...%3%2"1 ? 


I. fact t= n®fact(n=1) 


IL: EP n- 422 TBEN 
facts 
ELSE 
fact := n*fact(n-1) 


TIL} < IF. nie THEN 
fact <= 
ELSE 
fact t= fact(n+1)/(n+1) 


(A) I only ; 
(B) II only © ae | j (| 
(C) I and II only s : ; 
(D) II and III only : 

(Er i sal, and: ta : i 


6. What output is produced by the following program? 


PROGRAM abc; 
VAR 
n : integer; 


PROCEDURE increment (VAR a, b : integer); 
BEGIN 


BEGIN 

Mes S 

increment (n,n) 3 

write(n) ; . 
END. 


(A) 5 (B) 4 (Cc) 3 (D) 0 
(E) An error message 
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For which of the following applications would magnetic tape provide 
a better medium for file storage than a disk? 


Ur 


I. Transfer of files between computer systems 


II. Archival storage of files 
III. Storage of files requiring direct access 


(A) None 

(B) I only 

(Cc) I and II only 
(D LE: and LIC Case 
(E) 1, °kEy ane iit: 


Base your answers to questions 6-8 on the following binary search routine. 


First:=l; 
last:=N; 
middle:=A[(Firstt+last)div 2]: : ; 
while (A[middle] < > key) and (first <=last) do 
Begin 4 
if key < Al[middle] 
then last:= middle -1 
else first:= middle +l; 
middle := A[(first+last)div 2] 


end; 
6. If n= 20 and key is at position 19 how many times is A[middle] 
compared before key is located. 
7. If key is located at positions 8 and 9 which element, is found by 
this routine. 3 
8. Select one of the following: 
(A) it requires more comparisons to locate the key at position 20 
than it does at position l. 
(B) it requires more comparisons:to locate the key at position 1 
than it does at positior 20. 
(C) it requires the same number of comparisons to find the key at 
position 1 than it does at 20. 
9. Assuming that rand is a function that returns a random real value 


such that -0 < rand < 1 write a statement that would assign a 
random value between, 25 and 50 to test. 
test:= : 
aoe: ` 


10. Given the following program segment what is in the file at the end of 
execution. oes 


for i:= 1 to 3 do 
begin 
rewrite(f1); 
writeln(fl,sqr(1)) 
end; 


-21 





10. Consider the following sequence of procedure calls: 


11. 


-popped twice, (2) the two popped items to be added or multiplied, 


Push(x) 5 
Push(y); 
Add; 
Push(z) 5 
Push(w); 
Mult; 
Add; 


Invoking Push causes its argument to be pushed onto a. stack. 
Invoking the procedures Add or Mult causes (1) the stack to be 


and (3) the result to be pushed onto the stack. If x = 10, y= 
20,-2 = 30, and we 40, then at the end of the sequence of 


procedure calls above, the stack contains 


(A) nothing (B) 0 (c) 940 (D) 1230 (E) 1410 


Suppose that the variable d represents the number of dollars ina 
pank account after interest has just been eredited to that 


account (e.g.-, d = 123.456). Which of the following Pascal- 


expressions would round that amount to the nearest cent (e.g., to 
123.46)? 


(A) round(100*d)/100 
(B) round(d/100)#100 
(c) round(100*d) 

(D) round(d/100) 

(£) round(d) 
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A.P. Computer Science 


Practice A.P. Exam #1 


Section I: Multiple-Choice Questions 


- Section I consists of 50 questions. ‘Following are the directions 


for Section I and a representative set of z4 questions. ` (Answers 
are given on page 31.) i 


In this version of the sample questions, exponentiation will be 
denoted by the u*n symbol, rather than by the usual 


‘superscripting for mathematics tests, ¢-&- 
b^n 


will denote the nth power of b. However, there is no standard 
operation of exponentiation in Pascal, and the n^n symbol will 
have its standard Pascal meaning whenever it appears within 


Pascal code, 


Directions: For each of the following questions, decide which is 
the best of the choices given and -blacken the corresponding space 
on the answer sheet. You may use any available space for 
seratchwork; however, no credit will be given for anything 


“written in the examination booklet. Do not spend too much time 


on any one question. 


The XYZ Freight Company charges its customers for a shipment 
according to the following scale: 


- 4) $70 per ton for shipments weighing 10 tons or less; 
44) $50 per ton for shipments weighing more than 10 tons but not 


more than 20 tons; . 
4ii) $40 per ton for shipments weighing more than 20 tons. 


Which of the following is a reasonable set of test data for a 
program that accepts shipment weights as input and that outputs 


‘the charges for each of these shipments? 


4 


(A) The weights 5, 15, 25 

(B) The weights 0, 10, 20, 30 

(C) The weights 1, 11, 21,31 

(D) The weights 1, 10, gree es ae 
(E) The weights 5, 10,. 15; 20, 251-505 35, 40, 45 





13. 


14. 


The intent of the following algorithm is to delete the last node 
from a singly linked list. The variable first points to the 
first node in the list, if there is one, and has the value nik 


otherwise. 


TYPE node = RECORD 


datum : integer; 
link : “node; 
END; 


VAR p, q, first : “node; 


Which of the following describes the class of all linked lists 
for which this algorithm works correctly? 


(A) No linked lists 

(B) All nonempty linked lists. 

(C) All linked Lists with more than one entry 

(D) The empty list and all linked lists with more than one 


entry 
(E) All linked lists 


What is the askana number of nodes ic a pres tree of height -k? 
(A) 27 (k-1) 

(B) 23k =- E 

(C) 2°k 

(D) 2k- 1 


(E) 2°(k-1) - 1 


Suppose that items X1, X2, X3, X4, and X5 are pushed, in that 
order, onto an initially empty stack S, that S is then popped 
four times, and that as each Xi is popped off S, it is - inserted 
into an initially empty queue. If one Xi is then deleted from 
the queue, what is the next item that will be deleted from the 
queue? : 


? 


(A) YA (a): XS tes x5 (D) N (E) x5 
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3. A variable that is declare 


Questions 3-4 are based ona program with the. follow 
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| PROCEDURE C; H 
i i 


PROCEDURE D; 
BEGIN 
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is accessible in 


(A) 
(B) 
(c) 
(D) 
(E) 


all of program A 

both procedures B and C, but not in procedure D 
procedure B, but not in procedures C and D 
procedures B and D, but not in procedure C 
procedures B, C, and D, but not elsewhere in A 


Suppose that the progranm A contains no goto statements and 


procedure 
following 


procedures contained in 


(A) 
(B) 
(C) 


ealls other than those indicated. Which of 


ing structure. 


d in procedure B and only in procedure B 


no 


the 


lists describes completely the order in which the 


program A are called or invoked? 
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17. 


18. 


19. 


Questions 17-18 are based on the following information. 


Suppose that L is’ a list of distinct numbers arranged in 


increasing order and that a binary search is to be used to` 


determine whether some number x occurs in this list. 


If L contains 127 entries and x actually occurs as the 80th 
entry of L, how many entries from L will be examined by the 
search? 


(a). 1 | ago 40) Sete Ae ee ter 


If L contains 26 entries, what is the average number of entries 
in L that must be examined by the search? 


(A) 26 (83-113 (c) 6 (D): 55 (£). 4 


Ee ee aan ee a ial ee ow gee ane me (Se Deere Si Seeger cen 


Suppose one wishes to determine the second smallest number in a 
list of four distinct numbers by comparing numbers in that list. 
What is the maximum number of comparisons that will be made by 
the most efficient algorithm for this task? 


(A) Fewer than 3 


(B) 3 
(c) 4 
(D) 


5 
(E£) More than 5 


aA 
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Questions 7-8 are based on the following declarations. 


TYPE 


AgeType = «0.125; 
SexType = (M, F); 3 \ 
PartyType~ = (Republican, Democrat, Independent, Other); 
CitizenType = RECORD 
: $ age : AgeType; 
- gex : SexType; 
party : PartyType 
END; - 
VAR eee A 2a 7 
citizen : CitizenTyps; 
longevity : AgeType; 


7. Which of the following is a valid Pascal statement? 


(A) 
(B) 
(c) 
(D) 
(E) 


writeln(citizen.age) 
writeln(citizen.sex) 
writeln(citizen) 
writeln(age) 
writeln(sex) 


8. If G is a function whose definition begins with 


FUNCTION G(x, y : AgeType) : AgeType; 


which of the following is a valid Pascal statement? 


(A) 
(B) 
(c) 


9. Which of 


for a program that simulates the op 
to determine the 


w a o|o œ m = = 


G 

G(1,3) 

read(G(1,3)) 
writein(G(65,longevity)) 
longevity := G(5,citizen.party) 


a. on, i one, aa eae ain ee Oe 


res would be most appropriate 
eration of: an airport in order 
maximum delays encountered by arriving and 


the following data structu 


departing planes? 


(A) 

: (B) 

t (c) 

2, (D) 
(E) 
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24. 
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A retail merchant wishes to keep an inventory of items identified 
by six-digit identification numbers. The quantity of each item 
ranges from 0 to 1000 and must be updated each time that item is 
bought or sold. New identification numbers are created, and old 
ones discontinued, on a regular . basis. How might _one best 
maintain the inventory if one wishes to economize on both storage 
and processing time? 


(A) Use a single list A, letting A{n] be the quantity of item n, 


(B) Use a single list A, letting Aln] be the quantity of itém 
2%n-1 plus 1000 times the quantity of item 2*n. 


(C) Use two lists A and I, placing identification numbers in I in 
the crder in which they are encountered and letting Ain] be 
the quantity of item Tini: 


(D) Use two lists A and I, placing identification numbers in I in 
increasing numeric order and letting A[n] be the quantity of 
item I[n]. Sets . 


(E) Use two lists A and I, placing identification numbers. in I in 
the first available location beyond that indicated by a 
hashing function, and letting A[n] be the quantity of item 


Iln]. 


Which of the following would be the most likely reason for 
building file backup procedures into a computer system offering 
services to many different users? 


(A) To serve the public's right to know 

(B) To enable supervisors to monitor system use 

(C) To improve processing speed ; 

(D) To facilitate the interchange of information 

(E) To allow recovery fron system and/or user errors 


For which of the ‘following applications would magnetic tape 
provide a better medium for file storage than a disk? 


p Transfer of files between computer systems 
II. Archival storage of files 
III. Storage of files requiring direct access 


(A) None . 
(B) I only 5 
(Cc) I and II only 

(D) IL and 1II only 
(E) I, II, and III 


12. Suppose that the following progran segment is used to approximate 
a zero of the real-valued function 


0.001? 


(A) 
(B) 
(c) 
(D) 
(E) 


f starting with left 


f(x) xix -2 


1 and. right = 2. 


VAR 
left, right, x, epsilon : real; 


REPEAT 
x i= (left+right)/2; 
IF f(x) < 0 THEN 
Terts 
ELSE . 
right <= x 
- UNTIL (right-left) < epsilon 


. Bow many times must the loop be executed to produce an x which is 
guaranteed to be within epsilon of a zero of f when epsilon is 


Once 

10 times 

100 times 

1,000 times ; 

It cannot be determined from the information given. 





` The tree above is traversed using preorder 
traversal, and each time a node is visited, the 
label for that node is printed. Which of the 
following would be the resulting output? 


(A) atbec 
(B) abc*#+ 
(C) +a*be 
(D) a+ be 
(E) +» abe 


- function Result (m, n : integer) : integer ; 


var ` 
j, Temp : integer ; 
begin 
Temp :=1; 


forj:= 1 ton do 
Temp := Temp* m; 
Result := Temp 


end ; 


Of the following, which best indicates the task 







performed by the function above if only positive 


parameters are passed to the function? 


(A) lIt approximates the mth root ofn. 
(B) It approximates the nth root of m. 
(C) It computes the mth power of n. 
(D) It computes the mth power of'm. 
(E) It computes the roots of a quadratic with 
positive coefficients and with leading 
coefficient 1. 
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16. A&A certain binary tree T is stored as a two-dimensional 5x3 array 
K, with rows of A corresponding to nodes of T. The columns of A 
contain the following information: . af 

column 1 -- the row index of the left child 
column 2 -- the value stored at the node 
column 3 -- the row index of the right child 


A row index of 0 indicates a nonexistent child. If A consists of 
the entries a ee at 


Owo- 
oOraon 
oooor 


then T is given by whick of the following diagrams? ` 


(A). 62 = =) oe 0 
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Each record in a mailing list contains a name, 
address, social security number, and possibly 
some pointer or integer fields for linking 
records. The list is very seldom changed. There 
are two operations to be performed frequently: 
printing mailing labels ordered by zip code, and 
finding a particular record given its social 
security number. Assume that efficiency in both 
space and time is important. 


Of the following, which is the best way to store 
the mailing list records in order to solve the 
problem above? 


(A) Using two arrays, one ordered by social 
security number and one by zip code 

(B) Using one array, ordered by social security 
number, with records linked in zip code 
order 

(C) Using two linked lists, one ordered by 
social security number and one by zip 
code : 

(D) Using one array, ordered only by social 
security number Ree. 

(E) Using one array, ordered by zip code and 


by social security number within each zip 
code | ' 


function 2(k,n : integer) : integer ; 


begin 
ifn = k then 
Ze 
else 


if n > k then 
zi 2(k,n-k) 
else 
z i= 2(k—n,n) 


end ; 


Based on the function defined above, what is the 
value of z(6,8) ? 


(A) 1 
(B) 2 
(CE3 
(D) 4 
(E) 8 
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20. 


21. 


20-21 refer to the following procedure which sorts 15 real 


numbers from smallest to largest by performing an insertion sort. 


CONST 


n= 15; 


TIPE 


collection = array [1..n] of real; 
PROCEDURE sort (VAR a:collection); 


VAR 
‘4, j : integer; 
.temp : real; 


BEGIN 
FOR i := 2 TO n DO 
BEGIN {start pass i} 
temp := ali]; 
5 ’ 
-- WHILE (j > 1) and (alj=1} > temp) DO 
BEGIN 
alj] := a[lj-1]; 
§ t= jel 
` BND; {end while} 
alj] := temp 
END {end for} i 
END; ; : ° 


.Which of the following assertions is an invariant of the 


for-loop, i.e., which is true at the beginning of each pass 


through the loop? 


@) alji<alk) if 1483 5k S40 
(8s) a{j] < afk] if 1 Ss ae ee Se 
(cy at $)< alk) “Sr beds i <¢kig 1 
(p) aj) < alk] if 4534686 i 
,e4 te ot 


(Œ) alJ] < alk) if 


In the program above, the number of times the condition 
a[j-1] > temp 
is evaluated grows proportional to what function of n? 


(a) be- (B). nt O eeo (D) 2n (E) n^2 


mg 
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10. Let Rnd be a function that returns a random value uniformly 


distributed in the range 0 S Rnd < 1. Of the following 
expressions, which best represents the product of the outcomes 
of three unbiased dice cach rolled independently? 


(A) 18 * Rnd 

(B) trunc (6 * Rnd + 1) + sgr (6 * Rnd + 1) 

(C) round (6* Rnd + 1)* sqr (6 * Rnd + 1) 

(D) trunc (6 * Rnd + 1) * trune (6 * Rnd + 1) * trune (6 * Rnd + 1) 
(E) round (6 * Rnd + 1)* round (6 * Rnd + 1) * round (6 * Rnd + 1) 


~ 


. An operator is “right-associative” if successive 


appearances of the operater result in operands 
being combined right to left. For example, the 
expression - 


aoe boc 


would mean a ° (b ° c) if the operator ° is right- 
associative. What is the value of the expression 


32-16/8-4/2 


if all operators are right-associative and ifthe. 
operation of subtraction has higher precedence 
than that of division? 


(A) 2 
(B) 8 
(C) 28 
(D) 29+ 1/3 
(E) 32 
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Suppose an input list consists of the numbers 1, 
2, 3, 4, 5, © and there are three operations that 
can be performed, using only one stack S: . 


i. Copy next input to output list. 
ii, Push next input onto S. 
iii. Pop Sand output the popped integer. 


Which of the following is NOT a possible 
output list that could be achieved using these 
operations? 


(A) 123456 
(B) 654321 
(C) 246531 
(D) 345612 
(E) 125643 


_ After the program segment 


fori:= | to 3 do 
begin 
rewrite(Outfile) ; 
write( Outfile, i*i) 
end 
is executed, which of the following indicates the 
contents of Outfile? 


(A) 111 
(B) 1 
(C) 9 
(D) 123 
(E) 149 


_ Which of the following is a good reason for the 


declaration of user-defined types within a 
program? 


(A) It causes programs to execute more rapidly. 


(B) It conserves space in primary and 
„secondary memory. 

(C) It reduces compilation time. - 

(D) It makes a program easier to understand in 
terms of the problem to be solved. 

(E) It eliminates the need for procedures and 
functions. 
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USE THIS SPACE FOR SCRATCHWORK. 


23. Which of the following is a good reason for 
including global variables in a program? 


(A) To reduce the possibility of unwanted 
changes in variable values 3 

(B) To restrict intermodule communication to 
a few lines of code 

(C) To make programs easier to modify 

_ correctly 
(D) To bean aid to program reliability 
(E) None of the above 


24, Which of the following is the ordering of storage 
media, from fastest to slowest, in terms of the 
time required to access an item at a random 
location on the storage medium? 


(A) Magnetic tape, hard disk, semiconductor 
memory 

(B) Magnetic tape, semiconductor memory, 
hard disk 

Semiconductor memory, hard disk, 

magnetic tape 

(D) Semiconductor memory, magnetic tape, 
hard disk 

(E) Hard disk, semiconductor memory, 
magnetic tape 
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Questions 27-28 are based on the following informa- 


tion. 


Grades for students in a class are stored ina 


sequential file of type text. Each line of that file 
contains a student's last name and that student's 
scores on three exams. All names are unique and all 
items on a line are separated by spaces. For example, 


Ce 


28. 


Adams 85 96 92 
Jefferson 90 93 -89 


To compute the average grade for a student ` 
named Smith, a program must read 


(A) the entire file 

(B) all lines in the file up to and including the 
lines containing Smith’s name, but no 
other lines 


. (C) the line in the file containing Smith’s name, 


but no other lines - 
(D) those lines in the file that would be en- 


countered by a binary search for Smith’s - 


nante, but no other lines 

(E) the lines in the file starting with Smith’s 
name and continuing to the end of the 
file, but not any lines that come before 
Smith's name 


A program that is to record a change ina file to 
reflect a change in the grade on the second exam 
of a student named Smith must write a line 
containing Smith's name and copy 


(A) all other lines of the file 


(B) ali lines in the file up to Smith’s name, but 
no other lines 

(C) all lines in the file after Smith’s name, but 
no other lines 

(D) those lines in the file that would be 
encountered by a binary search of 
Smith's name, but no other lines 

(E) no other lines of the file 


l-h h 
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var USE THIS SPACE FOR SCRATCHWORK. 


s : array [1. .Length] of char ; 
p.i : integer ; 
Not Done : boolean ; 
begin 

for i := | to Length do 

read(s[i]) ; 
Ipe 

Not Done := true ; 

while (p <= Length) and (Not Done) do 
if s[p] =' ‘then 

Not Done := false 


else 
prep 
for i := p — | downto | do 
write(s[i)) 
end ; 


Let K denote a blank. In the program segment 
above, if Length = 10 and the input is 


John#Smith 
what is the output? . 
(A) Smith 

(B) John 

(C) BnhoJ 


(D) nhoJ 
(E) An out-of-bounds error message : 
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34. 


32 


Merge sort (internal) has which of the following 
advantages over bubble sort for long lists? 


I. Merge sort requires much less coding to 
implement than does bubble sort. 
ll. Merge sort runs faster than bubble sort. 
Ill. Merge sort requires less storage space 
than bubble sort. 


(A) I only 
(B) II only 
(C) Ill only 
(D) I and Il 
(E) H and Hl 


A procedure is to be written to search an array 
for a specified item and return its index. The 
problem specifications do not indicate what 
should be returned if the item sought appears in 
more than one place in the array. Which of the 
following actions would be most appropriate? 


(A) The programmer should write the 
procedure assuming that there will be no 
duplicate entries. : 

(B) The procedure should be written so as to 
return the index of all instances of the 
item. A 

(C) The specifications should be modified to 
indicate what should be done if there are 
duplicate entries. 

(D) The procedure should be written so as to 
output an error message if there are 
duplicate entries. - 

(E) The procedure should be written so as to 
delete duplicate entries if any are found. 
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f Questions 21-22 are based on the following program. 


program Ample(input, output) ; 
[l var a: integer ; 


procedure One(z : integer) ; 
begin 
writeln(z,a) 


end; 


procedure Two ; 
var x : integer ; 

begin 
a= 3% 
xp E: 
One(x) 

end ; 


begin {Ample} 
Pes Loe 
<5? WO 
writeln(a) 


y 


end. = 


21, What is written by procedure One when it is 
called by Two? 


(A) 23 

(B) 2 < undefined > 
(C) < undefined > 15 
(D) < undefined > 3 
(E) 215 


_ What is written by the final statement of 
the main execution block? 


(A) 2 
(B) 3 
(Cy 13 
(D) 20 
(E) < undefined > 


N 
N 
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34. The first step in a very efficient search procedure 
to find the pattern 'ABCDE' in an array of 
uppercase characters is to look at characters 
with index 


(A) 1,2,3, ... until an ‘A’ is encountered 

(B) 1,2,3, ... until any letter in 'A'..'E' is 
encountered * 

(C) 5.10,15, ... until an 'E' is encountered 

(D) 5,10,15, ... until any letter in ‘A’ ..'E' is 
encountered 

(E) 5,10,15, ... until any letter in Th An 
encountered 


aa 
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36. Suppose List? and List2 are pointers to the 


first nodes in each of two linked lists, and 

q points to some node in the first list. The initial 
segment of the first list, that is, all the nodes up 
to and inciuding the one pointed to by q, is to be 
removed and this segment put onto the begin- 
ning of the second list while the order of the 
nodes in the initial segment is maintained. If 
neither q nor List/ is nil, then this task is 
correctly performed by which of the following 
program segments, where p is a pointer? 


1. gt.Link := List2 ; 
List2 := List] ; 
List] := qt.Link 
Il. while List] <> qt.Link do 
begin 
p:= Listl; 
List] := Listlt.Link ; 
p!.Link := Bae 
List2 := p 
end 


I. p:=qt.Link ; 
qi.Link := bist? ; 
List2 := List] ; 
List] := p 


(A) None 

(B) Il‘only © 

(C) Land II only 
(D) land HI only 
(E) 1, H, and HI - 


“= 
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99, Ina certain card 


game, cards are distributed 
into seven separate piles. The rules of the game 
permit selecting a card in one pile and moving 
it, together with all cards on top of it, to the top 
of another pile. The cards that are moved are 
not to be reordered. Of the following data 
structures, which would be most appropriate to 
use in simulating this aspect of such a game? 


(A) Seven singly linked lists 

(B) Seven one-dimensional arrays 
(C) Seven stacks 

(D) Seven queues 

(E) Seven sets 


30. program Main ; 


var z : integer ; 
function F(x : integer) : integer ; 


begin 
if (x = 1) or (x = 3) then 
y ae 
else 
F:= x+ RAx-!). 
end ; 
begin {Mein} 
z:= A R2) + R5)) 
end. 


If maxint were large enough to allow the 
program above to be executed, then at the 
end of the program, the value of z would be 


(A) 62 

(B) 5!+2! 
(C) (5! +21)! 
(D) (7°)! 

(E) (62!)+(2!) 
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38. 


39. 


40. 


Suppose List is a variable of type | Node that 
points to the first node of a linear linked list of 
elements of type 


Node = record 
Datum : integer ; 
Link :1Node 


end 


Assume that the list is implemented without a 
header node. If the list is empty, which of the 
following conditions is both meaningful and 
true? 


(A) List = nil 

(B) Listt.Link = nil 
(C) Listt.Link = List 
(D) List =0 

(E) Link = nil 


It would be most appropriate to use a recursive 
function or procedure to solve a problem that 


(A) can be reduced to two, or more, simpler 
or smaller cases of the same problem 

(B) involves a substantial number of condi- 
tionals and nested loops 

(C) requires a lot of memory 

(D) involves storing data in a two-dimensional 
array 

(E) involves evaluation of the factorial function 


Consider the following four tasks: 


l. To perform a linear search of a list of 
n names 

2. To perform a binary search of a sorted 
list of n names 

3. To perform a selection sort into alpha- 
betical order of a list of n names that are 
initially in random order 

4. To perform a merge sort into alpha- 
betical order of a list of m names that are 
initially in random order 


For jarge n, which of the following lists these 
tasks in order (from least to greatest) of their 
worst-case running times? 
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33. A rectangular array is used to represent a maze. : 


Cells containing ones represent walls; cells 
containing zeros represent rooms that can be 
entered from neighboring rooms; i.e., from cells 
that adjoin them horizontally or vertically (but 
not diagonally). The cell at the upper left and 
the cell at the lower right each contain a Zero. 
Which of the following describe(s) a correct 
method of determining whether there is a path 
through the maze from the upper left to the 
lower right? 


l. Place a two in the upper left cell. Then 
repeatedly place a two in each cell that 
contains a zero and adjoins a cell 
containing a two. If a two ever appears 
in the lower right cell, then a path exists 
through the maze. 

Il. Move horizontally from the cell in the 
upper left corner of the maze until a wall 
or the edge of the array is reached. Then 
turn right and move until another wail 
or another edge is reached. Continue in 
this fashion, turning right each time an 
obstruction is reached, until either the 
lower right cell is reached (in which case 
there is a path through the maze) or the 
upper left cell is again reached (in which 
case there is not). 

III. Consider separately the one or two 
moves that are possible from the upper 
left cell (if no moves are possible, there 
is no path). In each case, reduce the size 
of the array by eliminating the row and 
column of the cell just left. Apply this 

method recursively until no moves are 
possible or until the lower right cell is 
reached. 


(A) I only 
(B) H only 
(C) HI only 
(D) Iland I 
(E) Iland I 
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Questions 44-45 are based on the following information. 


Newton's method for finding the square root ofa 
number A involves compuung successive approxima- 
tions x, to the square root of A using the formula 


x ere (A/x,+ x) 


where Xo is an initial estimate. 


44, Ina program implementing Newton’s method, a 
programmer forgot to include a test that would 
check for an initial estimate of zero. If the initial 
estimate is zero, when will the error be detected? 


(A) At edit time 
(B) At compile time 
(C) As soon as the value of A is entered 


(D) During execution of the program 
(E) When an incorrect result is output 


45. THIS ITEM 
NOT SCORED 
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“IF YOU FINISH BEFORE TIME IS CALL 


ED, YOU MAY CHECK YOUR WORK ON THIS = 
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erts the integers one at a time into an 


hat, after each insertion, an inorder traversal would always visit the 


s used to sort the list of seven integers 1, 2,. + 7 (which happens 
he binary tree that would be generated? 


ing generated is that the latest tree 


35, A particular algorithm uses a binary tree to sort a list of integers: It ins 


initially empty binary tree in such a way t 
nodes in increasing order. If this algorithm i 
to be in increasing order already), which of the following represents t 


(Assume that the only difference between successive versions of the tree be 
has the latest inserted value as a leaf.) 
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Question 37 is based on the following program. 


program Convert(input,outpul) ; 


const 
Base = 16; 


var 


Ch : char ; 


Num, Dig : integer ; 


begin ` 
Num :=0; 
while not eo/n do 
begin 
read(Ch) ; 


if Ch in ['0". .'9'] then 
Dig := ord(Ch) — ord('0') 


else 
case Ch of 


'A' : Dig: 
'B' : Dig: 
‘C': Die: 
'D' : Dig: 


'E' : Dig 
'F' : Dig 
end ; {case} 


10; 
Il; 
12; 
1B; 
i4: 
15 


Num := Num* Base + Dig 


end ; {while} 
writeln( Num) 
end. 


37, Of the following, which best describes the action 


of the program? 


(A) Given a base 16 number as input, the 
program writes the number in base 10. 
(B) Givena base 10 number as input, the 
program writes the number in base 16. 
(C) The program writes '‘ABCDEF’. 


(D) The program counts the characters in the © 


input that are in the set[‘A’ .. 'F']. 
(E) Given a base 16 number x as input, the 
program writes the value of 16* in 


base 10. 
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Let x and y be variables of type real with only USE THIS SPACE FOR SCRATCHWORK. 


positive values. Of the following, which best 
describes the conditions under which the 
Boolean expression, x + y = x, can have the 
value true? 


(A) Only when y >x 


(B) Only wken y < 

(C) Only when x is much greater than y 

(D) Only when the computer has 16-bit words 
(E) It can never have the value true. 


while (i <= Max) and (String[i] <> Symbol) doi:= i+ 1 


Which of the following is a loop invariant for the while loop above, 
ie., which is true each time the while-condition is tested? 


(A) i= Max 
(B) i=itl 
(C) String[j] = Symbol for all j such that i <J. 


- (D) Stringlj] * Symbol for all j such that i Sj. 


43. 


(E) String{j] ~ Symbol for all j such thatl Sj <i. 


Customers using an automatic teller to 
withdraw money from the XYZ National Bank 
must supply both their nonsecret account num- 
ber and their secret code number. The automatic 
teller is controlled by a computer program, one 
of whose subprograms can “scramble” any num- 
ber n to produce a value s(n). The scrambling: 
function s has the property that its inverse t (i.¢., 
that function ¢ such that (s(n)) = s((n)) = 2 
for all n) is very difficult to compute. Which of 
the following information about the account 
number A and the code number C for each 
customer should the automatic teller store in its 
data base (which is not very secure) in order to 
provide best protection against unauthorized 
withdrawals? 


(A) A,C 

(B) A,s(C) 
(C) A,s(t(C)) 
(D) s(A),C 
(E) #(A),C 
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